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POLYPEPTIDE-POLYMER CONJUGATES HAVING ADDED AND/OR REMOVED ATTACHMENT GROUPS 

FIELD OF THE INVENTION 

The present invention relates to polypeptide-polymer 
5 conjugates having added and/ or removed one or more attachment 
groups for coupling polymeric molecules on the siirface of the 3D 
structure of the polypeptide, a method for preparing polypeptide- 
polymer conjugates of the invention, the use of said conjugated 
for reducing the ixnmunogenicity and allergenicity, and 
10 compositions comprising said conjugate. 

BACKGROUND OF THE INVENTION 

The use of polypeptides, including enzymes, in the 
circulatory system to obtain a particular physiological effect is 
15 well-known in the medical arts. Further, within the arts of 
industrial applications, such as laundry washing, textile 
bleaching, person care, contact lens cleaning, food and feed 
preparation enzymes are used as a functional ingredient. One of 
the important differences between pharmaceutical and industrial 
20 application is that for the latter type of applications (i.e. 
industrial applications) the polypeptides (often enzymes) are not 
intended to enter into the circulatory system of the body. 

Certain polypeptides and enzymes have an unsatisfactory 
stability and may under certain circumstances - dependent on the 
25 way of challenge - cause an immune response, typically an IgG 
and/ or IgE response. 

It is today generally recognized that the stability of 
polypeptides is improved and the immune response is reduced when 
polypeptides, such as enzymes, are coupled to polymeric molecules. 
30 It is believed that the reduced immxine response is a result of the 
shielding of (the) epitope (s) on the surface of the polypeptide 
responsible for the immune response leading to antibody formation 
by the coupled polymeric molecules. 

Techniques for conjugating polymeric molecules to polypeptides 
35 are well-known in the art. 

One of the first suitable commercially techniques was described 
back in the early 1970 'ies and disclosed in e.g. US patent no. 
4,179,337. Said patent concerns non-immunogenic polypeptides, such 
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as enzymes and peptide hormones coupled to polyethylene glycol 
(PEG) or polypropylene glycol (PPG). At least 15% of polypeptides' 
physiological activity is maintained. 

GB patent no. 1,183^257 (Crook et al.) describes chemistry for 
5 conjugation of enzymes to polysaccharides via a triazine ring. 

Fiirther, techniques for maintaining of the enzymatic activity 
of enzyme-polymer conjugates are also known in the art. 

WO 93/15189 (Veronese et al.) concerns a method for maintaining 
the activity in polyethylene glycol-modif ied proteolytic enzymes 
10 by linking the proteolytic enzyme to a macromolecularized 
inhibitor. The conjugates are intended for medical applications. 

It has been fo\ind that the attachment of polymeric molecules to 
a polypeptide often has the effect of reducing the activity of the 
polypeptide by interfering with the interaction between the 
15 polypeptide and its substrate. EP 183 503 (Beecham Group PLC) 
discloses a development of the above concept by providing 
conjugates comprising pharmaceutically useful proteins linked to 
at least one water-soluble polymer by means of a reversible 
linking group. 

20 EP 471,125 (Kanebo) discloses skin care products comprising a 
parent protease (Bacillus protease with the trade name Esperase®) 
coupled to polysaccharides through a triazine ring to improve the 
thermal and preservation stability. The coupling technique used is 
also described in the above mentioned GB patent no. 1,183,257 

25 (Crook et al. ) . 

JP 3083908 describes a skin cosmetic material which 
contains a transglutaminase from guinea pig liver modified with 
one or more water-soluble substance such as PEG, starch, 
cellulose etc. The modification is performed by activating the 

30 polymeric molecules and coupling them to the enzyme. The 
composition is stated to be mild to the skin. 

However, it is not always possible to readily couple 
polymeric molecules to polypeptides and enzymes. Further, there is 
still a need for polypeptide-polymer conjugates with an even more 

35 reduced immunogenicity and/ or allergenicity. 



SUMMARY OF THE IMVEMTION 

It is the object of the present invention to provide improved 
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polypeptide-polymer conjugates suitable for industrial and 

pharmaceutical applications • 

The term "improved polypeptide-polymer conjugates" means in the 

context of the present invention conjugates having a reduced 
5 immiine response in hxmans and animals and/or a improved stability. 

As will be described further below the immune response is 

dependent on the way of challenge. 

The present inventors have found that polypeptides, such as 

enzymes, may be made less immunogenic and/or allergenic by adding 
10 and/or removing one or more attachment groups on the surface of 

the parent polypeptide to be coupled to polymeric molecules* 

When introducing pharmaceutical polypeptide directly into the 

circulatory system {i.e. bloodstream) the potential risk is an 

immunogenic response in the form of mainly IgG, IgA and/ or IgM 
15 antibodies. In contrast hereto, industrial polypeptides, such as 

enzymes used as a functional ingredient in e.gr. detergents, are 

not intended to enter the circulatory system. The potential risk 

in connection with industrial polypeptides is inhalation causing 

an allergenic response in the form of mainly IgE antibody 
20 formation. 

Therefore, in connection with industrial polypeptides the 
potential risk is respiratory allergenicity caused by inhalation, 
intratracheal and intranasal presentation of polypeptides. 

The main potential risk of pharmaceutical polypeptides is 
25 immunogenicity caused by intradermal ly, intravenously or subcu- 
taneous ly presentation of the polypeptide. 

It is to be understood that reducing the "immunogenicity" 
and reducing the "respiratory allergenicity" are two very 
different problems based on different routes of exposure and on 
30 two very different immunological mechanisms: 

The term "immunogenicity" used in connection with the 
present invention may be referred to as allergic contact 
dermatitis in a clinical setting and is a cell mediated delayed 
immune response to chemicals that contact and penetrate the skin. 
35 This cell mediated reaction is also termed delayed contact 
hypersensitivity (type IV reaction according to Gell and Combs 
classification of immune mechanisms in tissue damage) . 

The term "allergenicity" or "respiratory allergenicity" is an 
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immediate anaphylactic reaction (type I antibody-mediated reaction 
according to Gell and Combs) following inhalation of e.g. 
polypeptides. 

According to the present invention it is possible to provide 
5 polypeptides with a reduced immune response and/ or improved 
stability, which has a substantially retained residual activity. 

The allergic and the immunogenic response are in one term, at 
least in the context of the present invention called the "immune 
response" . 

10 In the first aspect the invention relates to a polypeptide- 
polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 

15 comparison to the number of attachment groups available on the 
corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 

20 of the polypeptide in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

The term "parent polypeptide" refers to the polypeptide to be 
modified by coupling to polymeric molecules. The parent 
polypeptide may be a naturally-occurring (or wild-type) 

25 polypeptide or may be a variant thereof prepared by any suitable 
means. For instance, the parent polypeptide may be a variant of a 
naturally-occurring polypeptide which has been modified by 
substitution, deletion or truncation of one or more amino acid 
residues or by addition or insertion of one or more amino acid 

3 0 residues to the amino acid sequence of a naturally-occurring 
polypeptide . 

A "suitable attachment group" means in the context of the 
present invention any amino acid residue group on the surface of 
the polypeptide capable of coupling to the polymeric molecule in 
35 question. 

Preferred attachment groups are amino groups of Lysine 
residues and the N-terminal amino group. Polymeric molecules may 
also be coupled to the carboxylic acid groups (-COOH) of amino 
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acid residues in the polypeptide chain located on the surface. 
Carboxylic acid attachment groups may be the carboxylic acid group 
of Aspartate or Glutamate and the C-terminal COOH-group. 

A "functional site" means any amino acid residues and/ or 
5 cofactors which are known to be essential for the performance of 
the polypeptide, such as catalytic activity, e.g. the catalytic 
triad residues, Histidine, Aspartate and Serine in Serine 
proteases, or e.g. the heme group and the distal and proximal 
Histidines in a peroxidase such as the /irthromyces ramosus 
10 peroxidase. 

In the second aspect the invention relates to a method for 
preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
15 3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a 

20 suitable attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
selected in step b) at or close to the functional site(s), 

d) coupling polymeric molecules to the mutated polypeptide. 

The invention also relates to the use of a conjugate of the 
25 invention and the method of the invention for reducing the 
immunogenicity of pharmaceuticals and reducing the allergenicity 
of industrial products. 

Finally the invention relates to compositions comprising a 
conjugate of the invention and further ingredients used in 
30 industrial products or pharmaceuticals. 

BRIEF DESCRIPTION OF THE DRAWING 

Figxire 1 shows the anti-lipase serum antibody levels after 5 
weekly immunizations with i) control ii) unmodified lipase 
35 variant, iii) lipase variant-SPEG. (X: log(serum dilution); Y 
Optical Density (490/620)). 



DETAILED DESCRIPTION OF THE INVENTION 
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It is the object of the present invention to provide improved 
polypeptide-polymer conjugates suitable for industrial and 
pharmaceutical applications • 

Even though polypeptides used for pharmaceutical applications 
5 and industrial application can be quite different the principle of 
the present invention may be tailored to the specific type of 
parent polypeptide (i.e. enzyme, hormone peptides etc.). 

The inventors of the present invention have provided improved 
polypeptide-polymer conjugates with a reduced immune response in 
10 comparison to conjugates prepared from the corresponding parent 
polypeptides. 

The present inventors have found that polypeptides, such as 
enzymes, may be made less immunogenic and/or less allergenic by 
adding one or more attachment groups on the surface of the parent 
15 polypeptide. In addition thereto the inventors have found that a 
higher percentage of maintained residual functional activity may 
be obtained by removing attachment groups at or close to the 
functional site(s). 

In the first aspect the invention relates to an improved 
20 polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
polypeptide having been modified in a manner to increase the 
number of attachment groups on the surface of the polypeptide in 
comparison to the number of attachment groups available on the 

25 corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
polypeptide having been modified in a manner to decrease the 
number of attachment groups at or close to the functional site(s) 
of the polypeptide in comparison to the number of attachment 

30 groups available on the corresponding parent polypeptide. 

Whether the attachment groups should be added and/ or removed 
depends on the specific parent polypeptide. 

a^ Addition of A ttachment crroups 
35 There may be a need for further attachment groups on the 
polypeptide if only few attachment groups are available on the 
surface of the parent polypeptide. The addition of one or more 
attachment groups by substituting or inserting one or more amino 
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acid residues on the surface of the parent polypeptide increases 
the number of polymeric molecules which may be attached in 
comparison to the corresponding parent polypeptide. Conjugates 
with an increased n\amber of polymeric molecules attached thereto 
5 are generally seen to have a reduced immune response in comparison 
to the corresponding conjugates having fewer polymeric molecules 
coupled thereto* 

Any available amino acid residues on the sxirface of the 
polypeptide, preferentially not being at or close to the 
10 functional site(s), such as the active site (s) of enzymes, may in 
principle be subject to substitution and/or insertion to provide 
additional attachment groups. 

As will be described further below the location of the 
additional coupled polymeric molecules may be of importance for 
15 the reduction of the immune response and the percentage of 
maintained residual functional activity of the polypeptide itself. 

A conjugate of the invention may typically have from 1 to 25, 
preferentially 1 to 10 or more additional polymeric molecules 
coupled to the surface of the polypeptide in comparison to the 
20 number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 

However, the optimal number of attachment group to be added 
depends (at least partly) on the surface area (i.e. molecular 
weight) of the parent polypeptide to be shielded by the coupled 
25 polymeric molecules, and further off -course also the number of 
already available attachment groups on the parent polypeptide. 

b) Removing Attachment groups 

In the case of enzymes or other polypeptides performing their 

30 function by interaction with a substrate or the like, polymeric 
molecules coupled to the polypeptide might be impeded by the 
interaction between the polypeptide and its substrate or the like, 
if they are coupled at or close to the functional site(s) (i.e. 
active site of enzymes) • This will most probably cause reduced 

35 activity. 

In the case of enzymes having one or more polymeric molecules 
coupled at or close to the active site a substantial loss of 
residual enzymatic activity can be expected. Therefore, according 
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to the invention conjugates may be constructed to maintain a 

higher percentage of residual enzymatic activity in comparison to 

a corresponding conjugates prepared on the basis of the parent 

enzyme in question. This may be done by substituting and/or 
5 deleting attachment groups at or close to the active site, hereby 

increasing the substrate affinity by improving the accessibility 

of the substrate in the catalytic cleft. 

An enzyme-polymer conjugate of the invention may typically have 

from 1 to 25, preferably 1 to 10 fewer polymeric molecules coupled 
10 at or close to the active site in comparison to the number of 

polymeric molecules of a conjugate prepared on the basis of the 

corresponding parent polypeptide. 

As will be explained below "at or close to" the functional 

site(s) means that no polymeric molecule (s) should be coupled 
15 within 5 A, preferably 8 A, especially 10 A of the functional 

site(s) . 

Removal of attachment groups at or close to the fiinctional 
site(s) of the polypeptide may advantageously be combined with 
addition of attachment groups in other parts of the surface of the 
20 polypeptide. 

The total number of attachment groups may this way be 
tinchanged, increased or decreased. However the location (s) of the 
total nvunber of attachment group(s) is(are) improved assessed by 
the reduction of the immune response and/ or percentage of 
25 maintained residual activity. Improved stability may also be 
obtained this way* 

The number of attachment groups 

Generally seen the number of attachment groups should be 
30 balanced to the molecular weight and/or surface area of the 
polypeptide. The more heavy the polypeptide is the more polymeric 
molecules should be coupled to the polypeptide to obtain 
sufficient shielding of the epitope (s) responsible for antibody 
formation. 

35 Therefore, if the parent polypeptide molecule is relatively 
light (e.g. 1 to 35 kDa) it may be advantageous to increase the 
total number of coupled polymeric molecules (outside the 
functional site(s)) to a total between 4 and 20. 
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If the parent polypeptide molecules is heavier, for instance 35 
to 60 kDa, the niimber of coupled polymeric molecules (outside the 
functional site(s)) may advantageously be increased to 7 to 40, 
and so on, 

5 The ratio between the molecular weight (Mw) of the polypeptide 
in question and the number of coupled polymeric molecules 
considered to be suitable by the inventors is listed below in 
Table 1. 

10 Table 1 



Molecular weight of parent 
polypeptide (M^) JtDa 


Number of polymeric 
molecules coupled to the 
polypeptide 


1 to 35 


4-20 


35 to 60 


7-40 


60 to 80 


10-50 


80 to 100 


15-70 


more than 100 


more than 20 



Reduced immune response vs. maintained residual enzymatic activitv 
Especially for enzymes, in compcurison to many other types of 
polypeptides, there is a conflict between reducing the immune 

15 response and maintaining a substantial residual enzymatic activity 
as the activity of enzymes are connected with interaction between 
a substrate and the active site often present as a cleft in the 
enzyme structure. 

Without being limited to any theory it is believed that the 

20 loss of enzymatic activity of enzyme-polymer conjugates might be a 
consequence of impeded access of the substrate to the active site 
in the form of spatial hindrance of the substrate by especially 
bulky and/or heavy polymeric molecules to the catalytic cleft. It 
might also, at least partly, be caused by disadvantageous minor 

25 structural changes of the 3D structure of the enzyme due to the 
stress made by the coupling of the polymeric molecules* 

Maintained residual activitv 

A polypeptide-polymer conjugates of the invention has a 
30 substantially maintained functional activity. 



- wo 98/35026 



PCT/DK98/00046 



10 

A "substantially" maintained functional activity is in the 
context of the present invention defined as an activity which is 
at least between 20% and 30%, preferably between 30% and 40%, more 
preferably between 40% and 60%, better from 60% up to 80%, even 
5 better from 80% up to about 100%, in comparison to the activity of 
the conjugates prepared on the basis of corresponding parent 
polypeptides • 

In the case of polypeptide-polymer conjugates of the 
invention where no polymeric molecules are coupled at or close to 

10 the functional site(s) the residual activity may even be up to 
100% or very close thereto. If attachment group (s) of the parent 
polypeptide is (are) removed from the functional site the activity 
might even be more than 100% in comparison to modified (i.e. 
polymer coupled) parent polypeptide conjugate. 

15 Position of coupled polymeric molecules 

To obtain an optimally reduced immune response (i.e. 
immunogenic and allergenic response) the polymeric molecules 
coupled to the surface of the polypeptide in question should be 
located in a suitable distance from each other. 

20 In a preferred embodiment of the invention the parent 
polypeptide is modified in a manner whereby the polymeric 
molecules are spread broadly over the surface of the polypeptide. 
In the case of the polypeptide in question has enzymatic activity 
it is preferred to have as few as possible, especially none, 

25 polymeric molecules coupled at or close to the area of the active 
site. 

In the present context "spread broadly over the surface of the 
polypeptide" means that the available attachment groups are 
located so that the polymeric molecules shield different parts of 

30 the surface, preferable the whole or close to the whole surface 
area away from the functional site(s), to make sure that 
epitope (s) are shielded and hereby not recognized by the immune 
system or its antibodies. 

The area of antibody-polypeptide interaction typically 

35 covers an area of 500 A^, as described by Sheriff et al. 
(1987), Proc. Natl. Acad. Sci. USA 84, p. 8075-8079. 500 
corresponds to a rectangular box of 25 A x 20 A or a circular 
region of radius 12.6 A. Therefore, to prevent binding of 
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antibodies to the epitope (s) to the polypeptide in question it 
is preferred to have a maximum distance between two attachment 
groups around 10 A. 

Consequently, amino acid residues which are located in excess 
5 of 10 A away from already available attachment groups are 

suitable target residues. If two or more attachment groups on the 
polypeptide are located very close to each other it will in most 
cases result in that only one polymeric molecule will be coupled. 
To ensiure a minimal loss of functional activity it is preferred 

10 not to couple polymeric molecules at or close to the functional 
site(s) . Said distance depends at least partly on the bulkiness of 
the polymeric molecules to be coupled, as impeded access by the 
bulky polymeric molecules to the functional site is undesired. 
Therefore, the more bulky the polymeric molecules are the longer 

15 should the distance from the functional site to the coupled 
polymeric molecules be. 

To maintain a substantial functional activity of the 
polypeptide in question attachment groups located within 5 A, 
preferred 8 A, especially 10 A from such functional site(s) 

20 should be left uncoupled and may therefore advantageously be 
removed or changed by mutation. Functional residues should 
normally not be mutated /removed, even though they potentially 
can be the target for coupling polymeric molecules. In said 
case it may thus be advantageous to chose a coupling chemistry 

25 involving different attachment groups. 

Further, to provide a polypeptide having coupled polymeric 
molecules at (a) known epitope (s) recognizable by the immune 
system or close to said epitope (s) specific mutations at such 
sites are also considered advantageous according to the invention. 

3 0 If the position of the epitope (s) is (are) unknown it is 
advantageous to couple several or many polymeric molecules to the 
polypeptide. 

As also mentioned above it is preferred that said attachment 
groups are spread broadly over the surface. 

35 

The attachment group 

Virtually all ionized groups, such as the amino groups of 
Lysine residues, are located on the surface of the polypeptide 
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molecule (see for instance Thomas E. Creighton, (1993), 
"Proteins", W.H. Freeman and Company, New York). 

Therefore, the number of readily accessible attachment groups 
(e.g. amino groups) on a modified or parent polypeptide equals 
5 generally seen the number of Lysine residues in the primary 
structure of the polypeptide plus the N-terminus amino group. 

The chemistry of coupling polymeric molecules to amino groups 
eire quite simple and well established in the art. Therefore, it is 
preferred to add and/ or remove Lysine residues (i.e. attachment 
10 groups) to/ from the parent polypeptide in question to obtain 
improved conjugates with reduced iramunogenicity and/or 
allergenicity and/ or improved stability and/or high percentage 
maintained functional activity. 

Polymeric molecules may also be coupled to the carboxylic 
15 groups (~COOH) of amino acid residues on the surface of the 
polypeptide. Therefore, if using carboxylic groups (including the 
C-terminal group) as attachment groups addition and/ or removal of 
Aspartate and Glutamate residues may also be a suitable according 
to the invention. 

20 If using other attachment groups, such as -SH groups, they 
may be added and/ or removed analogously. 

Substitution of the amino acid residues is preferred over 
insertion, as the impact on the 3D structure of the polypeptide 
normally will be less pronounced. 

25 Preferred substitutions are conservative substitutions. In the 
case of increasing the number of attachment groups the 
substitution may advantageously be performed at a location having 
a distance of 5 A, preferred 8 k, especially 10 A from the 
functional site(s) (active site for enzymes). 

30 An example of a suitable conservative substitution to obtain 

an additional amino attachment group is a Arginine to Lysine 
substitution. Examples of conservative substitutions to obtain 
additional carboxylic attachment groups are Aspargine to 
Aspartate/Glutamate or Glutamine to Aspartate/Glutamate 

35 substitutions. To remove attachment groups a Lysine residue may be 
substituted with a Arginine and so on. 



The parent polypeptide 
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In the context of the present invention the term "polypeptides" 
includes proteins, peptides and/or enzymes for pharmaceutical or 
industrial applications • Typically the polypeptides in question 
have a molecular weight in the range between about 1 to 100 kba, 
5 often 15 kDa and 100 kDa. 

Pharmaceutical polypeptides 

The term "pharmaceutical polypeptides" is defined as polypep- 
tides, including peptides, such as peptide hormones, proteins 
10 £uid/or enzymes, being physiologically active when introduced into 
the circulatory system of the body of humans and/or animals. 

Pharmaceutical polypeptides are potentially immunogenic as they 
are introduced into the circulatory system. 

Examples of "pharmaceutical polypeptides" contemplated 
15 according to the invention include insulin, ACTH, glucagon, 
somatostatin, somatotropin, thymosin, parathyroid hormone, 
pigmentary hormones, somatomedin, erythropoietin, luteinizing 
hormone, chorionic gonadotropin, hypothalmic releasing factors, 
antidiuretic hormones, thyroid stimulating hormone, relaxin, 
20 interferon, thrombopoietin (TPO) and prolactin. 

Industrial polypeptides 

Polypeptides used for industrial applications often have an 

enzymatic activity. Industrial polypeptides (e.g. enzymes) are (in 
25 contrast to pharmaceutical polypeptides) not intended to be 

introduced into the circulatory system of the body. 

It is not very like that industrial polypeptides, such as 

enzymes used as ingredients in industrial compositions and/or 

products, such as detergents and personal care products, including 
30 cosmetics, come into direct contact with the circulatory system of 

the body of humans or animals, as such enzymes (or products 

comprising such enzymes) are not injected (or the like) into the 

bloodstream. 

Therefore, in the case of the industrial polypeptide the 
35 potential risk is respiratory allergy (i.e. IgE response) as a 
consequence of inhalation to polypeptides through the respiratory 
passage. 

In the context of the present invention "industrial polypep- 
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tides" are defined as polypeptides, including peptides, proteins 
and/ or enzymes, which are not intended to be introduced into the 
circulatory system of the body of hxomans and/ or animals. 

Examples of such polypeptides are polypeptides, especially 
5 enzymes, used in products such as detergents, household article 
products, agrochemicals, personal care products, such as skin care 
products, including cosmetics and toiletries, oral and dermal 
pharmaceuticals, composition use for processing textiles, 
compositions for hard surface cleaning, and compositions used for 
10 manufacturing food and feed etc. 

Enzymatic activity 

Pharmaceutical or industrial polypeptides exhibiting enzymatic 
activity will often belong to one of the following groups of 

15 enzymes including Oxidoreductases (E.C. 1, "Enzyme Nomenclature, 
(1992), Academic Press, Inc.)/ such as laccase and Superoxide 
dismutase (SOD); Transferases, (E.G. 2), such as transglutaminases 
(TGases) ; Hydrolases (E.G. 3), including proteases, especially 
subtilisins, and lipolytic enzymes; Isomerases (E.G. 5) , such as 

20 Protein disulfide Isomerases (PDI) • 

Hydrolases 

Proteolytic enzymes 

Gontemplated proteolytic enzymes include proteases selected 
25 from the group of Aspartic proteases, such pepsins. Cysteine 

proteases, such as Papain, Serine proteases, such as subtilisins, 

or metallo proteases, such as Neutrase®. 

Specific examples of parent proteases include PD498 (WO 

93/24623 and SEQ ID NO. 2), Savinase® (von der Osten et al., 
30 (1993), Journal of Biotechnology, 28, p. 55+, SEQ ID NO 3), 

Proteinase K (Gunkel et al., (1989), Eur. J. Biochem, 179, p. 185- 

194), Proteinase R (Samal et al, (1990), Mol. Microbiol, 4, p. 

1789-1792), Proteinase T (Samal et al., (1989), Gene, 85, p. 329- 

333), Subtilisin DY (Betzel et al. (1993), Arch. Biophys, 302, no. 
35 2, p. 499-502), Lion Y (JP 04197182-A), Rennilase® (Available from 

Novo Nordisk A/S) , JA16 (WO 92/17576) , Alcalase® (a natural 

subtilisin Carlberg variant) (von der Osten et al., (1993), 

Journal of Biotechnology, 28, p. 55+). 
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Lipolytic enzymes 

Conteinplated lipolytic enzymes include Humicola lanuginosa 

lipases, e.g. the one described in EP 258 068 and EP 305 216 (See 
5 SEQ ID NO 6 below) , Humicola insolens, a Rhizomucor miehei lipase, 

e.g. as described in EP 238 023, Absidia sp. lipolytic enzymes (WO 

96/13578), a Candida lipase, such as a C. antarctica lipase, e.g. 

the C. antarctica lipase A or B described in EP 214 761, a 

Pseudomonas lipase such as a P. alcaligenes and P. 
10 pseudoalcaligenes lipase, e.g. as described in EP 218 272, a P. 

cepacia lipase, e.g. as described in EP 331 376, a Pseudomonas sp. 

lipase as disclosed in WO 95/14783, a Bacillus lipase, e.g. a B. 

subtilis lipase (Dartois et al., (1993) Biochemica et Biophysica 

acta 1131, 253-260), a B. stearothermophilus lipase (JP 64/744992) 
15 and a B. pumilus lipase (WO 91/16422). Other types of lipolytic 

include cutinases, e.g. derived from Pseudomonas mendocina as 

described in WO 88/09367, or a cutinase derived from Fusariujn 

solani pisi (e.g. described in WO 90/09446). 

20 Oxidoreductases 

Laccases 

Contemplated laccases include Polyporus pinisitus laccase (WO 
96/00290) , Myceliophthora laccase (WO 95/33836) , Schytalidium 
laccase (WO 95/338337), and Pyricularia oryzae laccase ('Available 
25 from Sigma; . 

Peroxidase 

Contemplated peroxidases include B. pumilus peroxidases (WO 
91/05858), Myxococcaceae peroxidase (WO 95/11964), Coprinus 
30 cinereus (WO 95/10602) and Arthrojnyces ramosus peroxidase 
(Kunishima et al. (1994), J. Mol. Biol. 235, p. 331-344). 

Transferases 

Trcmsgluteuninases 

35 Suitable transferases include any transglutaminases disclosed 
in WO 96/06931 (Novo Nordisk A/S) and WO 96/22366 (Novo Nordisk 
A/S) . 
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Isomerases 

Protein Disulfide Isomerase 

Without being limited thereto suitable protein disulfide 
isomerases include PDIs described in WO 95/01425 (Novo Nordisk 
5 A/S) . 

The polymeric molecule 

The polymeric molecules coupled to the polypeptide may be any 
suitable polymeric molecule, including natural and synthetic homo- 

10 polymers, such as polyols (i.e. poly-OH) , polyamines (i.e. poly- 
NH2) and polycarboxyl acids (i.e. poly-COOH) , and further hetero- 
polymers i.e. polymers comprising one or more different coupling 
groups e.g. a hydroxy 1 group and amine groups. 

Examples of suitable polymeric molecules include polymeric 

15 molecules selected from the group comprising polyalkylene oxides 
(PAO) , such as polyalkylene glycols (PAG) , including polyethylene 
glycols (PEG) , methoxypolyethylene glycols (mPEG) and polypropylen 
glycols, PEG-glycidyl ethers (Epox-PEG) , PEG-oxycarbonylimidazole 
(GDI -PEG) , Branced PEGs, poly-vinyl alcohol (PVA) , poly- 

2 0 carboxylates , poly- ( viny Ipyrolidone) , poly-D , L-amino acids , 
polyethylene-co-maleic acid anhydride, polystyrene-co-malic acid 
anhydrid, dextrans including carboxymethy 1-dextrans , heparin, 
homologous albumin, celluloses, including methylcellulose, 
carboxymethylcellulose , ethylcellulose, hydroxyethylcellulose 

25 carboxyethylcellulose and hydroxypropylcellulose, hydrolysates of 
chitosan, starches such as hydroxyethyl-straches and hydroxy 
propyl-starches, glycogen, agaroses and derivates thereof, guar 
gum, pullulan, inulin, xanthan g\im, carrageenin, pectin, alginic 
acid hydrolysates and bio-polymers. 

30 Preferred polymeric molecules are non-toxic polymeric molecules 
such as (m) polyethylene glycol ((m)PEG) which further requires a 
relatively simple chemistry for its covalently coupling to 
attachment groups on the enzyme's surface. 

Generally seen polyalkylene oxides (PAO) , such as polyethylene 

35 oxides, such as PEG and especially mPEG, are the preferred 
polymeric molecules, as these polymeric molecules, in comparison 
to polysaccharides such as dextran, pullulan and the like, have 
few reactive groups capable of cross-linking. 
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Even though all of the above mentioned polymeric molecules may 
be used according to the invention the methoxypoly ethylene glycols 
(mPEG) may advantageously be used. This arise from the fact that 
methoxy ethylene glycols have only one reactive end capable of 
5 conjugating with the enzyme. Consequently, the risk of cross- 
linking is less pronounced. Further, it makes the product more 
homogeneous and the reaction of the polymeric molecules with the 
enzyme easier to control. 

10 Preparation of enzvme variants 

Enzyme variants to be conjugated may be constructed by any 
suitable method. A number of methods are well established in 
the art. For instance enzyme variants according to the 
invention may be generated using the same materials and methods 

15 described in e.g. WO 89/06279 (Novo Nordisk A/S) , EP 130,756 
(Genentech) , EP 479,870 (Novo Nordisk A/S) , EP 214,435 
(Henkel) , WO 87/04461 (Amgen) , WO 87/05050 (Genex) , EP appli- 
cation no. 87303761 (Genentech) , EP 260,105 (Genencor) , WO 
88/06624 (Gist-Brocades NV) , WO 88/07578 (Genentech) , WO 

20 88/08028 (Genex) , WO 88/08033 (Amgen), WO 88/08164 (Genex) , 
Thomas et al. (1985) Nature, 318 375-376; Thomas et al. (1987) 
J. Mol. Biol., 193, 803-813; Russel and Fersht (1987) Nature 
328 496-500. 

25 Generation of site directed mutations 

Prior to mutagenesis the gene encoding the polypeptide of 
interest must be cloned in a suitable vector- Methods for 
generating mutations in specific sites is described below. 

Once the polypeptide encoding gene has been cloned, and 

3 0 desirable sites for mutation identified and the residue to 
substitute for the original ones have been decided, these 
mutationis can be introduced using synthetic oligonucleotides. 
These oligonucleotides contain nucleotide sequences flanking the 
desired mutation sites; mutant nucleotides are inserted during 

35 oligo-nucleotide synthesis. In a preferred method, Site-directed 
mutagenesis is carried out by SOE-PCR mutagenesis technique 
described by Kammann et al. (1989) Nucleic Acids Research 17(13), 
5404, and by Sarkar G, and Soiamer, S.S. (1990); Biotechniques 8, 
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404-407. 

Activation of polymers 

If the polymeric molecules to be conjugated with the 
5 polypeptide in question are not active it must be activated by the 
use of a suitable technique. It is also contemplated according to 
the invention to couple the polymeric molecules to the polypeptide 
through a linker. Suitable linkers are well-known to this skilled 
person. 

10 Methods and chemistry for activation of polymeric molecules 
as well as for conjugation of polypeptides are intensively 
described in the literature. Commonly used methods for activation 
of insoluble polymers include activation of functional groups with 
cyanogen bromide , per iodate , glutaraldehyde , biepoxides , 

15 epichlorohydrin, divinylsulf one, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 
immobilisation. Fundamental and applications", Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking", CRC Press, Boca Raton; G.T. Hermanson et al., 

20 (1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 
polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulf onylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 

25 amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 
considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

30 In the following a number of suitable polymer activation 

methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of poly- 
peptides may be performed with the aid of diimide and for example 

35 amino-PEG or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. 
Soc, 98, 289-291) or diazoacetate/ amide (Wong et al., (1992), 
"Chemistry of Protein Conjugation and Crosslinking", CRC Press). 
Coupling polymeric molecules to hydroxy groups are generally 
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very difficult as it must be performed in water. Usually 
hydrolysis predominates over reaction with hydroxyl groups. 

coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
5 disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 
selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

10 Techniques involving coupling electrophilically activated 
PEGS to the amino groups of Lysines may also be useful. Many of 
the usual leaving groups for alcohols give rise to an amine 
linkage. For instance, alkyl sulfonates, such as tresylates 
(Nilsson et al., (1984), Methods in Enzymology vol. 104, Jacoby, 

15 W. B., Ed., Academic Press: Orlando, p. 56-66; Niisson et al., 
(1987), Methods in Enzymology vol. 135; Mosbach, K. , Ed.; Academic 
Press: Orlando, pp. 65-79; Scouten et al., (1987), Methods in 
Enzymology vol. 135, Mosbach, K. , Ed., Academic Press: Orlando, 
1987; pp 79-84; Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 

20 93, pp. 4217-4219), mesylates (Harris, (1985), suora ; Harris et 
al., (1984), J. Polym. Sci. Polym. Chem. Ed. 22, pp 341-352), aryl 
sulfonates like tosylates, and para-nitrobenzene sulfonates can be 
used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
25 converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with nucleo- 
philes like amino groups in polypeptides allow stable linkages to 
be formed between polymer and polypeptide. In addition to high 
conjugation yields, the reaction conditions are in general mild 
30 (neutral or slightly alkaline pH, to avoid denaturation and little 
or no disruption of activity) , and satisfy the non-destructive re- 
quirements to the polypeptide. 

Tosylate is more reactive than the mesylate but also more un- 
stable decomposing into PEG, dioxane, and sulfonic acid (Zalipsky, 
35 (1995), Bioconjugate Chem., 6, 150-165). Epoxides may also been 
used for creating amine bonds but are much less reactive than the 
above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise 
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to carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
patent no. 5,122,614, (1992); Zalipsky et al. , (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al • , (1995), Biocon- 
5 jugate Chem. , 6, 62-69, with imidazole (Allen et al*, (1991), 
Carbohydr. Res,, 213, pp 309-319), with para-nitrophenol, DMAP (EP 
632 082 Al, (1993), Looze, Y.) etc. The derivatives are usually 
made by reacting the chloroformate with the desired leaving group. 
All these groups give rise to carbamate linkages to the peptide. 

10 Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 

Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5,349,001, (1994), Greenwald et al.). The reactivity of these com- 

15 pounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 
(1992), Zalipsky). This group may be activated with N-hydroxy sue- 

20 cinimide. 

Furthermore^ a special linker can be introduced. The oldest 
being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
25 375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
yields a very reactive diazonium salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
30 Greenwald, R. B.) thus introducing an additional amide linkage. 

As some peptides do not comprise many Lysines it may be 
advantageous to attach more than one PEG to the same Lysine. This 
can be done e.g. by the use of l,3-diamino-2-propanol. 

PEGS may also be attached to the amino-groups of the enzyme 
35 with carbamate linkages (WO 95/11924, Greenwald et al.). Lysine 
residues may also be used as the backbone. 

The coupling technique used in the examples is the N- 
succinimidyl carbonate conjugation technique descried in WO 
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90/13590 (Enzon) . 

Method for preparing improved conjugates 

It is also an object of the invention to provide a method for 
5 preparing improved polypeptide-polymer conjugates comprising the 
steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
10 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
15 selected in step b) at or close to the functional site(s) , 

d) coupling polymeric molecules to the mutated polypeptide. 

Step a) Identifying amino acid residues located on the sur face of 
the parent polvpeptide 

20 

3-dimensional structure f 3D-structure) 

To perform the method of the invention a 3-dimensional 

structure of the parent polypeptide in question is required. 

This structure may for example be an X-ray structure, an NMR 
25 structure or a model-built structure. The Brookhaven Databank 

is a source of X-ray- and NMR-structures . 

A model-built structure may be produced by the person 

skilled in the art if one or more 3D-structure(s) exist (s) of 

homologous polypeptide (s) sharing at least 30% sequence 
30 identity with the polypeptide in question. Several software 

packages exist which may be employed to construct a model 

structure. One example is the Homology 95.0 package from 

Biosyra. 

Typical actions required for the construction of a model 
35 structure are: alignment of homologous sequences for which 3D- 
structures exist, definition of Structurally Conserved Regions 
(SCRs) , assignment of coordinates to SCRs, search for 
structural fragments/ loops in structure databases to replace 
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Variable Regions, assignment of coordinates to these regions, 
and structural refinement by energy minimization. Regions 
containing large inserts (^3 residues) relative to the known 
3D-structures are known to be quite difficult to model, and 
5 structural predictions must be considered with care. 

Having obtained the 3D-structure of the polypeptide in 
question, or a model of the structure based on homology to 
known structures, this structure serves as an essential 
prerequisite for the fulfillment of the method described below. 

10 

Step b) Selection of target amino acid residues for mutation 
Target amino acid residues to be mutated are according to 
the invention selected in order to obtain additional or fewer 
attachment groups, such as free amino groups (-NH2) or free 
15 cairboxylic acid groups (-COOH), on the surface of the 

polypeptide and/or to obtain a more complete and broadly spread 
shielding of the epitope (s) on the surface of the polypeptide. 

Conservative substitution 
20 It is preferred to make conservative substitutions in the 

polypeptide, as conservative substitutions secure that the 

impact of the mutation on the polypeptide structure is limited. 
In the case of providing additional amino groups this may be 

done by substitution of Arginine to Lysine, both residues being 
25 positively charged, but only the Lysine having a free amino 

group suitable as an attachment groups. 

In the case of providing additional carboxylic acid groups 

the conservative substitution may for instance be an Aspargine 

to Aspartic acid or Glutamine to Glutamic acid substitution. 
30 These residues resemble each other in size and shape, except 

from the carboxylic groups being present on the acidic 

residues. 

In the case of providing fewer attachment groups, e.gr. at or 
close to the active site, a Lysine may be substituted with a 
35 Arginine, and so on. 

Which amino acids to substitute depends in principle on the 
coupling chemistry to be applied. 
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Non-conservative substitution 

The mutation may also be on target amino acid residues which 
are less/non-conservative. Such mutation is suitable for 
obtaining a more complete and broadly spread shielding of the 
5 polypeptide surface than can be obtained by the conservative 
substitutions* 

The method of the invention is first described in general 
terms, and subsequently using specific examples. 

Note the use of the following terms: 
10 Attachment_residue: residue(s) which can bind polymeric 
molecules, e.g. Lysines (amino group) or Aspartic/Glutamic 
acids (carboxylic groups) . N- or C-terminal amino/ carboxy lie 
groups are to be included where relevant. 

Mutation_residue: residue (s) which is to be mutated, e.g. 
15 Arginine or Aspargine/Glutamine. 

Essential_catalytic_residues: residues which are known to be 
essential for catalytic function, e.g. the catalytic triad in 
Serine proteases. 

Solvent_exposed_residues: These are defined as residues which 
20 are at least 5% exposed according to the BIOSYM/ INSIGHT 

algorithm found in the module Homology 95.0. The sequence of 
commands are as follows: 

Homology=>ProStat=>Access_Surf=>Solv_Radius 1.4; Heavy atoms 
only; Radii source VdW; Output: Fractional Area; Polarity 
25 source: Default. The file f ilename_area. tab is produced. Note: 
For this program to function properly all water molecules must 
first be removed from the structure. 
It looks for example like: 
# PD498FINALM0DEL 



30 


# residue 


area 




TRP_1 


136.275711 




SER_2 


88.188095 




PR0_3 


15.458788 




ASN_4 


95.322319 


35 


ASP_5 


4.903404 




PR0_6 


68.096909 




TYR_7 


93.333252 




TYR^S 


31.791576 
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SER_9 95 •983139 

• . continued 

1. Identification of residues which are more than 10 A away 

5 from the closest attachment_residue, and which are located at 
least 8 A away from essential_catalytic_residues. This residue 
subset is called REST, and is the primary region for 
conservative mutation_residue to attachment_residue 
substitutions . 

10 

2. Identification of residues which are located in a 0-5 A 
shell around subset REST, but at least 8 A away from 
essential_catalytic__residues. This residue subset is called 
SUB5B. This is a secondary region for conservative 

15 mutation_residue to attachment_residue substitutions, as a 

ligand bound to an attachment_residue in SUB5B will extend into 
the REST region and potentially prevent epitope recognition. 

3. Identification of solvent_exposed routation_residues in REST 
20 and SUB5B as potential mutation sites for introduction of 

attachment_r esidues . 

4. Use BIOSYM/INSIGHT's Biopolymer module and replace residues 
identified under action 3. 

25 

5. Repeat 1-2 above producing the subset RESTx- This subset 
includes residues which are more than 10 A away from the 
nearest attachment_residue, and which are located at least 8 A 
away from essential catalytic residues. 

30 

6. Identify solvent_exposed_residues in RESTx. These are 
potential sites for less/non-conservative mutations to 
introduce atttachment residues. 



Step c) Substituting, inserting or deleting amino acid residues 

The mutation (s) performed in step c) may be performed by 
standard techniques well known in the art, such as site-directed 
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mutagenesis (see, e.g., Sambrook et al. (1989), Sambrook et al., 
Molecular Cloning. A Laboratory Manual, Cold Spring Harbor, NY. 

A general description of nucleotide substitution can be found 
in e.g. Ford et al., 1991, Protein Expression and Purification 2, 
5 p. 95-107. 

Step d) Coupling polvmeric molecul es to the modified parent enzyme 
Polypeptide-polymer conjugates of the invention may be 
prepared by any coupling method known in the art including the 
10 above mentioned techniques. 

Coupling of polymeric molecules to the polvpeptide in question 

If the polymeric molecules to be conjugated with the 
polypeptide are not active it must be activated by the use of a 

15 suitable method. The polymeric molecules may be coupled to the 
polypeptide through a linker. Suitable linkers are well known to 
the skilled person. 

Methods and chemistry for activation of polymeric molecules as 
well as for conjugation of polypeptides are intensively described 

20 in the literature. Commonly used methods for activation of 
insoluble polymers include activation of functional groups with 
cyanogen bromide, periodate, glutaraldehyde, biepoxides, 
epichlorohydrin, divinylsulfone, carbodiimide, sulfonyl halides, 
trichlorotriazine etc. (see R.F. Taylor, (1991), "Protein 

25 immobilisation. Fundamental and applications". Marcel Dekker, 
N.Y.; S.S. Wong, (1992), "Chemistry of Protein Conjugation and 
Crosslinking" , CRC Press, Boca Raton; G.T. Hermanson et al., 
(1993), "Immobilized Affinity Ligand Techniques", Academic Press, 
N.Y.). Some of the methods concern activation of insoluble 

30 polymers but are also applicable to activation of soluble polymers 
e.g. periodate, trichlorotriazine, sulfonylhalides, 

divinylsulfone, carbodiimide etc. The functional groups being 
amino, hydroxyl, thiol, carboxyl, aldehyde or sulfydryl on the 
polymer and the chosen attachment group on the protein must be 

35 considered in choosing the activation and conjugation chemistry 
which normally consist of i) activation of polymer, ii) 
conjugation, and iii) blocking of residual active groups. 

In the following a number of suitable polymer activation 
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methods will be described shortly. However, it is to be understood 
that also other methods may be used. 

Coupling polymeric molecules to the free acid groups of enzymes 
can be performed with the aid of diimide and for example amino-PEG 
5 or hydrazino-PEG (Pollak et al., (1976), J. Amr. Chem. Soc, 98, 
289-291) or diazoacetate/amide (Wong et al., (1992), "Chemistry of 
Protein Conjugation and Crosslinking", CRC Press). 

Coupling polymeric molecules to hydroxy groups are generally 
very difficult as it must be performed in water. Usually 

10 hydrolysis predominates over reaction with hydroxy 1 groups. 

Coupling polymeric molecules to free sulfhydryl groups can be 
reached with special groups like maleimido or the ortho-pyridyl 
disulfide. Also vinylsulfone (US patent no. 5,414,135, (1995), 
Snow et al.) has a preference for sulfhydryl groups but is not as 

15 selective as the other mentioned. 

Accessible Arginine residues in the polypeptide chain may be 
targeted by groups comprising two vicinal carbonyl groups. 

Techniques involving coupling electrophilically activated PEGs 
to the amino groups of Lysines are also be useful. Many of the 

20 usual leaving groups for alcohols give rise to an amine linkage. 
For instance, alkyl sulfonates, such as tresylates (Nilsson et 
al., (1984), Methods in Enzymology vol. 104, Jacoby, W. B., Ed., 
Academic Press: Orlando, p. 56-66; Nilsson et al., (1987), Methods 
in Enzymology vol. 135; Mosbach, K. , Ed.; Academic Press: Orlando, 

25 pp. 65-79; Scouten et al., (1987), Methods in Enzymology vol. 135, 
Mosbach, K., Ed., Academic Press: Orlando, 1987; pp 79-84; 
Crossland et al., (1971), J. Amr. Chem. Soc. 1971, 93, pp. 4217-4- 
219), mesylates (Harris, (1985), supra ; Harris et al., (1984), J. 
Polym. Sci. Polym. Chem. Ed. 22, pp. 341-352), aryl sulfonates 

30 like tosylates, and para-nitrobenzene sulfonates can be used. 

Organic sulfonyl chlorides, e.g. Tresyl chloride, effectively 
converts hydroxy groups in a number of polymers, e.g. PEG, into 
good leaving groups (sulfonates) that, when reacted with 
nucleophiles like amino groups in polypeptides allow stable 

35 linkages to be formed between polymer and polypeptide. In addition 
to high conjugation yields, the reaction conditions are in general 
mild (neutral or slightly alkaline pH, to avoid denaturation and 
little or no disruption of activity) , and satisfy the non- 
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destructive requirements to the polypeptide - 

Tosylate is more reactive than the mesylate but also more 
unstable decomposing into PEG, dioxane, and sulfonic acid 
(Zalipsky, (1995), Bioconjugate Chem. , 6, 150^165). Epoxides may 
5 also been used for creating amine bonds but are much less reactive 
than the above mentioned groups. 

Converting PEG into a chloroformate with phosgene gives rise to 
carbamate linkages to Lysines. This theme can be played in many 
variants substituting the chlorine with N-hydroxy succinimide (US 
10 patent no. 5,122,614, (1992); Zalipsky et al., (1992), Biotechnol. 
Appl. Biochem., 15, p. 100-114; Monfardini et al., (1995), 
Bioconjugate Chem. , 6, 62-69, with imidazole (Allen et al., 

(1991) , Carbohydr. Res., 213, pp 309-319), with para-nitrophenol, 
DMAP (EP 632 082 Al, (1993), Looze, Y.) etc. The derivatives are 

15 usually made by reacting the chloroformate with the desired 
leaving group. All these groups give rise to carbamate linkages to 
the peptide. 

Furthermore, isocyanates and isothiocyanates may be employed 
yielding ureas and thioureas, respectively. 
20 Amides may be obtained from PEG acids using the same leaving 
groups as mentioned above and cyclic imid thrones (US patent no. 
5 , 349 , 001 , ( 1994 ) , Greenwald et al . ) . The reactivity of these 
compounds are very high but may make the hydrolysis to fast. 

PEG succinate made from reaction with succinic anhydride can 
25 also be used. The hereby comprised ester group make the conjugate 
much more susceptible to hydrolysis (US patent no. 5,122,614, 

(1992) , Zalipsky). This group may be activated with N-hydroxy 
succinimide . 

Furthermore, a special linker can be introduced. The oldest 
30 being cyanuric chloride (Abuchowski et al., (1977), J. Biol. 
Chem., 252, 3578-3581; US patent no. 4,179,337, (1979), Davis et 
al.; Shafer et al., (1986), J. Polym. Sci. Polym. Chem. Ed., 24, 
375-378. 

Coupling of PEG to an aromatic amine followed by diazotation 
35 yields a very reactive diazoniim salt which in situ can be reacted 
with a peptide. An amide linkage may also be obtained by reacting 
an azlactone derivative of PEG (US patent no. 5,321,095, (1994), 
Greenwald, R. B.) thus introducing an additional amide linkage- 
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As some peptides do not comprise many Lysines it may be advan- 
tageous to attach more than one PEG to the same Lysine. This can 
be done e.g. by the use of 1, 3-diamino-2-propanol. 

PEGS may also be attached to the amino-groups of the enzyme 
5 with carbamate linkages (WO 95/11924, Greenwald et al.)- Lysine 
residues may also be used as the backbone. 

Addition of attachment groups 

Specific examples of PD498 variant-SPEG coniuaates 
10 A specific example of a protease is the parent PD498 (WO 
93/24623 and SEQ ID NO. 2). The parent PD498 has a molecular 
weight of 29 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from the 


Arginine 


Lysine 


active site 






0-5 A 


1 




5-10 A 






10-15 A 


5 . 


6 


15-20 A 


2 


3 


20-25 A 


1 


3 


total 


9 


12 



15 The inventors examined which parent PD498 sites on the surface 
may be suitable for introducing additional attachment groups. 

A. Suitable conservative Arginine to Lysine substitutions in 
parent PD498 may be any of R51K, R62K, R121K, R169K, R250K, R28K, 
R190K, 

20 B. Suitable non-conservative substitutions in parent PD498 may 
be any of P6K, Y7K, S9K, AlOK, YllK, Q12K, D43K, Y44K, N45K, 
N65K, G87K, I8BK, N209K, A211K, N216K, N217K, G218K, y219K, 
S220K, Y221K, G262K. 

As there is no Lysine residues at or close to the active site 

25 there is no need for removing any attachment group. 

PD498 variant-SPEG conjugates may be prepared using any of the 
above mentioned PD498 variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific example is 

30 described below. 
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Removal of attachment groups 

Specific examples of BPN" variant-SPEG conjugates 

A specific example of a protease having an attachment group in 
5 the active site is BPN' which has 11 attachment groups (plus an N- 
terminal amino group): BPN' has a molecular weight of 28 kDa. 

Lysine and Arginine residues are located as follows: 



Distance from 


Arginine 


Lysine 


the active site 






0-5 A 




1 


5-10 A 






10-15 A 


1 


4 


15-20 A 


1 


4 


20-25 A 




2 


total 


2 


11 



10 The Lysine residue located within 0-5 A of the active site can 

according to the invention advantageously be removed. Specifically 

this may be done by a K94R substitution. 

BPN' variant-SPEG conjugates may be prepared using the above 

mentioned BPN" variant as the starting material by any conjugation 
15 technique known in the art for coupling polymeric molecules to 

amino groups on the enzyme. 

Addition and removal of attachment groups 

Specific example of Savinase®-SPEG conjugates 
20 As described in Example 2 parent Savinase® (von der Osten et 

al., (1993), Journal of Biotechnology, 28, p. 55+ and SEQ ID NO. 

3) may according to the invention have added a nuiaber of amino 

attachment groups to the surface and removed an amino attachment 

group close to the active site. 
25 Any of the following substitutions in the parent Savinase® 

are sites for mutagenesis: RIOK, R19K, R45K, R145K, R170K, 

R186K and R247K. 

The substitution K94R are identified as a mutation suitable 

for preventing attachment of polymers close to active site. 
30 Savinase® variant-SPEG conjugates may be prepared using any of 
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the above mentioned Savinase® variants as the starting material by 
any conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme, 

5 Addition of attachment groups 

A specific examples of Humicola lanuginosa lipase variants-SPEG 
conjugates 

Specific examples of lipase variants with reduced 
immunogenicity using the parent Huminocal lanuginosa DSM 4109 
10 lipase (see SEQ ID No 6) as the backbone for substitutions are 
listed below. 

The parent unmodified Humicola lanuginosa lipase has 8 
attachment groups including the N- terminal NH2 group and a 
molecular weight of about 29 kDa. 
15 A. Suitable conservative Arginine to Lysine substitutions in the 
parent lipase may be any of R133K, R139K, R160K, R179K, R209K, 
R118K and R125K* 

Suitable non-conservative substitutions in the parent lipase 
may be any of: 

20 A18K,G31K,T32K,N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 

Further suitable non-conservative substitution in the Humicola 
lanuginosa lipase include: E87K or D254K. 

25 Lipase variant-SPEG conjugates may be prepared using any of the 
above mentioned lipase variants as the starting material by any 
conjugation technique known in the art for coupling polymeric 
molecules to amino groups on the enzyme. A specific exeimple is 
described below. 

30 In Example .12 below is it shown that a conjugate of the 
Humicola lanuginosa lipase variant with a E87K+D254K substitutions 
coupled to S-PEG 15,000 has reduced immunogenic response in Balb/C 
iaice in comparison to the corresponding parent unmodified enzyme. 

35 Immunogenicity and Alleraenicitv 

"Immunogenicity" is a wider term than "antigenicity" and 
"allergenicity", and expresses the immune system's response to the 
presence of foreign substances. Said foreign substances are called 
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iinmunogens, antigens and allergens depending of the type of immune 
response the elicit. 

An "immunogen" may be defined as a substance which, when intro- 
duced into circulatory system of animals and humans, is capable of 
5 stimulating an immunologic response resulting in formation of 
immunoglobulin. 

The term "antigen" refers to substances which by themselves are 
capable of generating antibodies when recognized as a non-self 
molecule. 

10 Further, an "allergen" may be defined as an antigen which may 
give rise to allergic sensitization or an allergic response by IgE 
antibodies (in hximans, and molecules with comparable effects in 
animals) • 

15 Assessment of iimnunoaencitv 

Assessment of the immunogenicity may be made by injecting 
animal subcutaneously to enter the immunogen into the circulation 
system and comparing the response with the response of the 
corresponding parent polypeptide. 

20 The "circulatory system" of the body of h\unans and animals 
means, in the context of the present invention, the system which 
mainly consists of the heart and blood vessels. The heart delivers 
the necessary energy for maintaining blood circulation in the 
vascular system. The circulation system functions as the 

25 organism's transportation system, when the blood transports O2, 
nutritious matter, hormones, and other substances of importance 
for the cell regulation into the tissue. Further the blood removes 
CO2 from the tissue to the lungs and residual substances to e.g. 
the kidneys. Furthermore, the blood is of importance for the 

30 temperature regulation and the defence mechanisms of the body, 
which include the immune system. 

A ntamber of in vitro animal models exist for assessment of the 
immunogenic potential of polypeptides. Some of these models give a 
suitable basis for hazard assessment in man. Suitable models 

35 include a mice model. 

This model seek to identify the immunogenic response in the 
form of the IgG response in Balb/C mice being injected 
subcutaneously with modified and unmodified polypeptides. 



- wo 98/35026 



32 



PCT/DK98/00046 



Also other animal models can be used for assessment of the 
immunogenic potential. 

A polypeptide having "reduced immunogenicity" according to the 
invention indicates that the amount of produced antibodies, e.g. 
5 immunoglobulin in humans, and molecules with comparable effects in 
specific animals, which can lead to an immune response, is 
significantly decreased, when introduced into the circulatory 
system, in comparison to the corresponding parent polypeptide. 

For Balb/C mice the IgG response gives a good indication of the 
10 immunigenic potential of polypeptides. 

Assessment of alleraenicitv 

Assessment of allergenicity may be made by inhalation tests, 
comparing the effect of intratracheally (into the trachea) 

15 administrated parent enzymes with the corresponding modified 
enzymes according to the invention. 

A number of in vivo animal models exist for assessment of the 
allegenicity of enzymes. Some of these models give a suitable 
basis for hazard assessment in man. Suitable models include a 

20 guinea pig model and a mouse model. These models seek to identify 
respiratory allergens as a function of elicitation reactions 
induced in previously sensitised animals. According to these 
models the alleged allergens are introduced intratracheally into 
the amimals. 

25 A suitable strain of guinea pigs, the Dunkin Hartley strain, do 
not as humans, produce IgE antibodies in connection with the 
allergic response. However, they produce another type of antibody 
the IgGlA and IgGlB (see e.g. Prent0, ATLA, 19, p. 8-14, 1991), 
which are responsible for their allergenic response to inhaled 

30 polypeptides including enzymes. Therefore, when using the Dunkin 
Hartley animal model, the relative amount of IgGlA and igGlB is a 
measure of the allergenicity level. 

The Balb/C mice strain is suitable for intratracheal exposure. 
Balb/C mice produce IgE as the allergic response. 

35 More details on assessing respiratory allergens in guinea pigs 
and mice is described by Kimber et al.,(1996). Fundamental and 
Applied Toxicology, 33, p. 1-10. 
Other animals such as rats, rabbits etc. may also be used for 
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comparable studies. 
Compos it ion 

The invention relates to a composition comprising a 
5 polypeptide-polymer conjugate of the invention. 

The composition may be a pharaaceutical or industrial 
composition. 

The composition may further comprise other polypeptides, 
proteins or enzymes and/or ingredients normally used in e.g. 

10 detergents, including soap bars, household articles, 
agrochemicals, personal care products, including skin care 
compositions, cleaning compositions for e.g. contact lenses, oral 
and dermal pharmaceuticals, composition use for treating textiles, 
compositions used for manuf actviring food, e.g. baking, and feed 

15 etc. 

Use of the polypeotide-polvmer conjugate 

The invention also relates to the use of the method of the 
invention for reducing the immxme response of polypeptides. 
20 It is also an object of the invention to use the polypeptide- 
polymer conjugate of the invention to reduce the allergenicity of 
industrial products, such as detergents, such as laundry, disk 
wash and hard surface cleaning detergents, and food or feed 
products . 

25 

MATERIAL AND METHODS 
Materials 

Enzymes : 

PD498: Protease of subtilisin type shown in WO 93/24623, The 
30 sequence of PD498 is shown in SEQ ID NO. 1 and 2. 
Savinase® (Available from Novo Nordisk A/S) 

Humicola lanuginosa lipase: Available from Novo Nordisk as 
lipolase® and is further described in EP 305,216. The DNA and 
protein sequence is shown in SEQ ID NO 5 and 6, respectively. 
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Strains; 

B. subtilis 309 and 147 are variants of Bacillus lentus , 
deposited with the NCIB and accorded the accession numbers NCIB 
5 10309 and 10147, and described in US Patent No. 3,723,250 
incorporated by reference herein. 

E. coli MC 1000 (M.J. Casadaban and S.N. Cohen (1980); J. 
Mol. Biol. 138 179-207), was made r^^m"*" by conventional methods 
and is also described in US Patent Application Serial No. 
10 039,298. 

Vectors; 

pPD498: E. coli - B. subtilis shuttle vector (described in 
US patent No. 5,621,089 under section 6.2.1.6) containing the 
15 wild-type gene encoding for PD498 protease (SEQ ID NO. 2) . The 
same vector is use for mutagenesis in E. coli as well as for 
expression in B. subtilis. 

General molecular biology methods; 

20 Unless otherwise mentioned the DNA manipulations and 

transformations were performed using standard methods of 
molecular biology (Sambrook et al. (1989) Molecular cloning; A 
laboratory manual. Cold Spring Harbor lab. , Cold Spring Harbor, 
NY; Ausubel, F. M. et al. (eds.) "Current protocols in 

25 Molecular Biology". John Wiley and Sons, 1995; Harwood, C. R., 
and Cutting, S. M. (eds.) "Molecular Biological Methods for 
Bacillus". John Wiley and Sons, 1990). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

30 

Materials, chemicals and solutions; 

Horse Radish Peroxidase labeled anti-rat-Ig (Dako, DK, P162, # 
031; dilution 1;1000). 
35 Mouse anti-rat IgE (Serotec MCA193; dilution 1;200). 
Rat anti-mouse IgE (Serotec MCA419; dilution 1:100). 
Biot in-labeled mouse anti-rat IgGl monoclonal antibody (Zymed 03- 
9140; dilution 1;1000) 
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Biotin-labeled rat anti-mouse IgGl monoclonal antibody (Serotec 
MCA336B; dilution 1:1000) 

Streptavidin-horse radish peroxidase (Kirkeg&rd & Perry 14-30-00; 
dilution 1:1000). 
5 CovaLink NH2 plates (Nunc, Cat# 459439) 
• Cyanuric chloride (Aldrich) 
Acetone (Merck) 

Rat anti-Mouse IgGl, biotin (SeroTec, Cat# MCA336B) 
Streptavidin, peroxidase (KPL) 
10 Ortho-Phenylene-diamine (OPD) (Kem-en-Tec) 
H2O2, 30% (Merck) 
Tween 20 (Merck) 
Skim Milk powder (Difco) 
H2SO4 (Merck) 

15 

Buffers and Solutions: 
" Carbonate buffer (0.1 M, pH 10 (1 liter)) Na2C03 10.60 g 

PBS (pH 7.2 (1 liter)) NaCl 8.00 g 

KCl 0.20 g 

20 K2HPO4 1.04 g 

KH2PO4 0.32 g 

Washing buffer PBS, 0.05% (v/v) Tween 20 
Blocking buffer PBS, 2% (wt/v) Skim Milk powder 

Dilution buffer PBS, 0.05% (v/v) Tween 20, 0.5% (wt/v) Skim Milk 
25 powder 

Citrate buffer (O.IM, pH 5.0-5.2 (1 liter) )NaCitrate 20.60 g 

Citric acid 6.30 g 

Activation of CovaLink plates: 

• Make a fresh stock solution of 10 mg cyanuric chloride per ml 
30 acetone. 

• Just before use, dilute the cyanuric chloride stock solution 
into PBS, while stirring, to a final concentration of Img/ml. 

• Add 100 ml of the dilution to each well of the CovaLink NH2 
plates, and incubate for 5 minutes at room temperature. 

35 • Wash 3 times with PBS. 

• Dry the freshly prepared activated plates at 50<»C for 30 
minutes. 

• Immediately seal each plate with sealing tape. 
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• Preactivated plates can be stored at room temperature for 3 
weeks when kept in a plastic bag. 

Sodium Borate, borax (Sigma) 
5 3,3-Dimethyl glutaric acid (Sigma) 
CaCl2 (Sigma) 

Tresyl chloride (2,2,2-triflouroethansulfonyl chloride) (Fluka) 
l-ethyl-3- (3-dimethylaminopropyl) carbodiimide (EDC) (Fluka) 
N-Hydroxy succinimide (Fluka art. 56480) ) 
10 Phosgene (Fluka art. 79380) 
Lactose (Merck 7656) 

PMSF (phenyl methyl sulfonyl f louride) from Sigma 
Succinyl-Alanine-Alanine-Proline-Phenylalanine-para-nitroanilide 
(Suc-AAPF-pNP) Sigma no. S-7388, Mw 624.6 g/mole. 

15 

Colouring substrate; 

OPD: o-phenylene-diamine, (Kementec cat no. 4260) 
Test Animals; 

20 Dunkin Hartley guinea pigs (from Charles River, DE) 

Female Balb/C mice (about 20 grams) purchased from Bomholdtgaard, 
Ry, Denmark. 

Equipment; 
25 XCEL II (Novex) 

ELISA reader (UVmax, Molecular Devices) 
HPLC (Waters) 
PFLC (Pharmacia) 

Superdex-75 column, Mono-Q, Mono S from Pharmacia, SW. 
3 0 SLT; Fotometer from SLT Lablnstruments 

Size-exclusion chromatograph (Spherogel TSK-G2000 SW) . 
Size-exclusion chromatograph (Superdex 200, Pharmacia, SW) 
Amicon Cell 

35 Enzymes for DNA manipulations 

Unless otherwise mentioned all enzymes for DNA 
manipulations, such as e.g. restriction endonucleases , ligases 
etc., are obtained from New England Biolabs. Inc. 
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Methods 

ELISA procedure for determination of IcrGj^ positive guinea pigs 

ELISA microtiter plates are coated with rabbit anti-PD498 
5 1:8000 in carbonate buffer and incubated over night at 4°C. The 
next day the plates is blocked with 2% BSA for 1 hour and washes 3 
times with PBS Tween 20. 

1 lig/ml PD498 is added to the plates and incubated for 1 hour, 
then washed 3 times with PBS Tween 20. 
10 All guinea pig sera samples and controls are applied to the 

ELISA plates with 2 [il sera and 98 ^1 PBS, incubated for 1 hour 
and washed 3 times with PBS Tween 20. 

Then goat anti-guinea pig IgGi (1:4000 in PBS buffer (Nordic 
Immunology 44-682)) is applied to the plates, incubated for 1 hour 
15 and washed with PBS tween 20. 

Alkaline phosphatase marked rabbit anti-goat 1:8000 (Sigma 
A4187) is applied and incubated for 1 hour, washed 2 times in PBS 
Tween20 and 1 time with diethanol amine buffer. 

The marked alkaline phosphatase is developed using p- 
20 nitrophenyl phosphate for 30 minutes at 31''C or until appropriate 
colour has developed. 

The reaction is stopped using Stop medium (K2HP04/HaH3 buffer 
comprising EDTA (pH 10)) and read at OD 405/650 using a ELISA 
reader . 

25 Double blinds are included on all ELISA plates. 

Positive and negative sera values are calculated as the 
average blind values added 2 times the standard deviation. This 
gives an accuracy of 95%. 

30 Determination of the molecule weight 

Electrophoretic separation of proteins was performed by standard 
methods using 4-20% gradient SDS poly acrylamide gels (Novex) . 
Proteins were detected by silver staining. The molecule weight was 
measured relative to the mobility of Mark-12® wide range molecule 

35 weight standards from Novex. 



Protease activity 
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Analysis with Suc-Ala-Ala-Pro-Phe-pNa ; 

Proteases cleave the bond between the peptide and p- 
nitroaniline to give a visible yellow colour absorbing at 405 nm* 

Buffer: e.g. Britton and Robinson buffer pH 8.3 
5 Substrate: 100 mg suc-AAPF-pNa is dissolved into 1 ml dimethyl 
sulfoxide (DMSO) • 100 |il of this is diluted into 10 ml with 
Britton cmd Robinson buffer. 

The substrate and protease solution is mixed and the 
absorbance is monitored at 405 nm as a function of time and ABS405 
10 nm/roin. The temperature should be controlled (20-50®C depending on 
protease) . This is a measure of the protease activity in the 
sample. 

Proteolvtic Activitv 

15 In the context of this invention proteolytic activity is 
expressed in Kilo NOVO Protease Units (KNPU) . The activity is 
determined relatively to an enzyme standard (SAVINASE^) , and 
the determination is based on the digestion of a dimethyl 
casein (DMC) solution by the proteolytic enzyme at standard 

20 conditions, i.e. 50*>C, pH 8.3, 9 min. reaction time, 3 min, 
measuring time. A folder AF 220/1 is available upon request to 
Novo Nordisk A/S, Denmark, which folder is hereby included by 
reference. 

A GU is a Glycine Unit, defined as the proteolytic enzyme 
25 activity which, under standard conditions, during a 15-minutes* 
incubation at 40^C, with N-acetyl casein as substrate, produces 
an amount of NH2 -group equivalent to 1 mmole of glycine. 

Enzyme activity can also be measured using the PNA assay, 
according to reaction with the soluble substrate succinyl- 
30 alanine-alanine-proline-phenyl-alanine-para-nitrophenol , which 
is described in the Journal of American Oil Chemists Society, 
Rothgeb, T.M. , Goodlander, B.D., Garrison, P.H., and Smith, 
L.A., (1988). 

35 Fermentation of PD498 variants 

Fermentation of PD498 variants in B. suhtilis are performed 
at 30^C on a rotary shaking table (300 r.p.m.) in 500 ml baffled 
Erlenmeyer flasks containing 100 ml BPX medium for 5 days. In 
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order to make an e.g. 2 liter broth 20 Erlenmeyer flasks are 
f erment ed s imultaneous ly • 

Media : 

5 BPX; Composition (per liter) 



Sodium caseinate lOg 

The starch in the medium is liquefied with a-amylase and 
the medium is sterilized by heating at 120''C for 45 minutes. 
After sterilization the pH of the medium is adjusted to 9 by 
15 addition of NaHCOa to 0.1 M. 

Purification of PD498 variants 

Approximately 1.6 litres of PD498 variant fermentation 
broth are centrifuged at 5000 rpm for 35 minutes in 1 litre 

20 beakers. The supernatants are adjusted to pH 7.0 using 10% 
acetic acid and filtered on Seitz Supra SlOO filter plates. 
The filtrates are concentrated to approximately 400 ml using an 
Amicon CH2A UF unit equipped with an Amicon SIYIO UF cartridge. 
The UF concentrate is centrifuged and filtered prior to 

25 absorption at room temperature on a Bacitracin affinity column 
at pH 7i The PD498 variant is eluted from the Bacitracin column 
at room temperature using 25% 2-propanol and 1 M sodium 
chloride in a buffer solution with 0.01 dime-thyl-glutaric 
acid, 0.1 M boric acid and 0.002 M calcium chloride adjusted to 

30 pH 7. 

The fractions with protease activity from the Bacitracin 
purification step are combined and applied to a 750 ml Sephadex 
G25 column (5 cm diameter) equilibrated with a buffer 
containing 0.01 dimethylglutaric acid, 0.1 M boric acid and 
35 0.002 M calcium chloride adjusted to pH 6.0. 

Fractions with proteolytic activity from the Sephadex G25 
column are combined and applied to a 150 ml CM Sepharose CL 6B 
cat-ion exchange column (5 cm diameter) equilibrated with a 



Potato starch 



lOOg 



10 



Ground barley 
Soybean flour 
Na2HP04 X 12 H2O 
Pluronic 



50g 
20g 
9g 

O.lg 
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buffer containing 0.01 M dime thy Iglutaric acid, 0.1 M boric 
acid, and 0.002 M calcium chloride adjusted to pH 6.0. 
The protease is eluted using a linear gradient of 0-0.5 M 
sodium chloride in 1 litres of the same buffer. 
5 Protease containing fractions from the CM Sepharose column are 
combined and filtered through a 2[i filter. 

Balb/C mice laG ELISA Procedure: 

• The antigen is diluted to 1 mg/ml in carbonate buffer. 
10 * 100 ml is added to each well. 

• The plates are coated overnight at 4®C. 

• Unspecific adsorption is blocked by incubating each well for 1 
hour at room temperature with 200 ml blocking buffer. 

• The plates are washed 3x with 300 ml washing buffer. 

15 • Unknown mouse sera are diluted in dilution buffer, typically 
lOx, 2 Ox and 4 Ox, or higher. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

- Unbound material is removed by washing 3x with washing buffer. 
20 • The anti-Mouse IgGl antibody is diluted 2000x in dilution 

buffer. 

- 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with washing buffer. 
25 • Streptavidine is diluted lOOOx in dilution buffer. 

• 100 ml is added to each well. 

• Incubation is for 1 hour at room temperature. 

• Unbound material is removed by washing 3x with 300 ml washing 
buffer. 

30 • OPD (0.6 mg/ml) and H2O2 (0.4 ml/ml) is dissolved in citrate 
buffer. 

• 100 ml is added to each well. 

• Incubation is for 10 minutes at room temperature. 

• The reaction is stopped by adding 100 ml H2SO4. 

35 • The plates are read at 492 nm with 620 nm as reference. 

Immunisation of mice 

Balb/C mice (20 grams) are immunised 10 times (intervals of 14 
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days) by subcutaneous injection of the modified or unmodified 
polypeptide in question, respectively by standard proceedures 
known in art. 



5 EXAMPLES 
Example 1 

suitable substitutions in PD498 for addi tion of amino 
10 attachment groups f-NHo ) 

The 3D structure of parent PD498 was modeled as described 
above based on 59% sequence identity with Thermitase® 
(2tec.pdb) . 

The sequence of PD498 is (see SEQ ID NO. 2) • PD498 residue 
15 numbering is used, 1-280. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and ma]ceKzone2 .bcl below: 



Conservative substitutions: 

20 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 55,0,255 

3 Zone Subset LYS :lys:NZ Static monomer /residue 10 
Color_Subset 255,255,0 

25 4 Zone Subset NTERM :1:N Static monomer/residue 10 
Color_Subset 255,255,0 

5 #NOTE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 39, 72, 226 Static monomer /residue 8 
30 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object name according to the protein 

10 Combine Subset REST Difference PD498FINALMODEL ALLZONE 
35 11 List Subset REST Atom Output_File restatom. list 

12 List Subset REST monomer /residue Output__File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 

14 List Subset ACTSITE Atom Output_File actsiteatora. list 

15 List Subset ACTSITE monomer /residue Output_File 
40 actsitemole. list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
Color_Subset 

18 Coitibine Subset SUB5A Difference REST5A ACTSITE 
45 19 Combine Subset SUB5B Difference SUB5A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255 255 255 

21 List 'subset SUB5B Atom Output_Pile subSbatom. list 

22 List Subset SUB5B monomer /residue Output_File sub5bmole. list 
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23 #Now identify sites for lys->arg substitutions and continue 
with makezone2.bcl 

24 #Use grep command to identify ARG in restatom. list, 
subSbatom. list & accsiteatom. list 

5 

Comments : 

Lines 1-8: The subset ALLZONE is defined as those residues 
which are either within 10 A of the free amino groups on 
lysines or the N- terminal, or within 8 A of the catalytic triad 
10 residues 39, 72 and 226 • 

Line 10: The subset REST is defined as those residues not 
included in ALLZONE. 

Lines 17-20: Subset SUB5B is defined as those residues in a 
5 A shell around REST, excluding residues within 8 A of the 
15 catalytic residues. 

Line 23-24: REST contains Arg62 and Argl69, SUB5B contains 
ArgSl, Argl21, and Arg250. ACTSITE contains Argl03, but 
position 103 is within 8 A from essential_catalytic_residues, 
and thus not relevant. 
20 The colour codes are: (255,0,255) = magenta, 

(255,255,0)yellow, (255,0,0) red, and (255, 255, 255)= white. 

The substitutions R51K, R62K, R121K, R169K and R250K are 
identified in parent PD498 as suitable sites for mutagenesis. 
The residues are substituted below in section 2, and further 
25 analysis done: 



Non-conservative substitutions: 
]aakeK2one2 .bcl 

I #sourcefile makezone2.bcl Claus von der Osten 961128 
30 2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #NOTE: editnextline object name according to protein 

5 Copy Object -To_Clipboard -Displace PD498FINALMODEL 
35 newmodel 

6 Biopolyraer 

7 #NOTE: editnextline object name according to protein 

8 Blank Object On PD498FINALMODEL 

9 #NOTE: editnextlines with lys->arg positions 
40 10 Replace Residue newmodel: 51 lys L 

II Replace Residue newmodel: 62 lys L 

12 Replace Residue newmodel: 121 lys L 

13 Replace Residue newmodel: 169 lys L 

14 Replace Residue newmodel: 250 lys L 
45 15 # 
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16 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

17 Color Molecule Atoms newmodel Specified Specification 
255,0,255 

5 18 Zone Subset LYSx newmodel: lys:NZ Static monomer/ residue 10 
Color_Subset 255,255,0 

19 Zone Subset NTERMx newmodel: 1:N Static monomer /residue 10 
Color_Subset 255,255,0 

20 #NOTE: editnextline ACTSITEx residues according to the 
10 protein 

21 Zone Subset ACTSITEx newmodel: 3 9, 7 2, 226 Static 
monomer /residue 8 Color__Subset 255,255,0 

22 Combine Subset ALLZONEx Union LYSx NTERMx 

23 Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 24 Combine Subset RESTx Difference newmodel ALLZONEx 

25 List Subset RESTx Atom Output^File restxatom. list 

26 List Subset RESTx monomer/ residue Output_File 
restxmole. list 

27 # 

20 28 Color Molecule Atoms ACTSITEx Specified Specification 
255,0,0 

29 List Subset ACTSITEx Atom Output File act sitexatom. list 

30 List Subset ACTSITEx monomer/ residue Output_File 
actsitexmole. list 

25 31 # 

32 #read restxatom • list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

Comments : 

30 Lines 1-15: Solvent exposed arginines in subsets REST and 

SUB5B are replaced by lysines. Solvent accessibilities are 

recalculated following arginine replacement. 

Lines 16-23: The subset ALLZONEx is defined as those 

residues which are either within 10 A of the free amino groups 
35 on Lysines (after replacement) or the N~terminal, or within 8 A 

of the catalytic triad residues 39, 72 and 226. 

Line 24-26: The subset RESTx is defined as those residues 

not included in ALLZONEx, i.e. residues which are still 

potential epitope contributors. Of the residues in RESTx, the 
40 following are >5% exposed (see lists below): 6-7,9-12,43- 

45,65,87-88,209,211,216-221,262. 

The following mutations are proposed in parent PD498: P6K, 

Y7K, S9K, AlOK, YllK, Q12K, D43K, Y44K, N45K, N65K, G87K, I88K, 

N209K, A211K, N216K, N217K, G218K, Y219K, S220K, Y221K, G262K. 
45 Relevant data for Example 1: 

Solvent accessibility data for PD498MODEL: 

# PD498MODEL Fri Nov 29 10:24:48 MET 1996 

# residue area 
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TRP 


1 


136.275711 




SER 


'2 


88.188095 




PRO 


3 


15*458788 




asn" 


'4 


95.322319 


5 


ASP 5 


4.903404 




PRO 


6 


68.096909 




TYR 


1 


93.333252 




TYR_ 


"8 


31.791576 




SER 


'9 


95.983139 


10 


AIiA" 


10 


77.983536 




TYR*" 


11 


150.704727 




GLN^ 


'12 


26.983349 




tyr" 


"13 


44.328232 




gly" 


*14 


3.200084 


15 


PRO 


"15 


2. 149547 




GLN 


"16 


61.385445 




ASN^ 


"17 


37.776707 




thr" 


"18 


1.237873 




SER" 


"19 


41.031750 


20 


thr" 


"20 


4.321402 




pro" 


"21 


16.658991 




ALA"" 


"22 


42. 107288 




ALA" 


"23 


0.000000 




TRP" 


"24 


3.713619 


25 


asp" 


"25 


82.645493 




val" 


'26 


74.397812 




thr" 


"27 


14.950654 




arg" 


"28 


110.606209 




gly" 


'29 


0.242063 


30 


ser" 


'30 


57.225292 




ser" 


'31 


86.986198 




thr" 


'32 


1.928865 




gln" 


"3 3 


42.008949 




thr" 


'34 


0.502189 


35 


val"" 


'35 


0.268693 




ALA" 


"36 


0.000000 




val" 


"37 


5.255383 




leu 


"38 


1.550332 




asp" 


"39 


3.585718 


40 


ser" 


"40 


2.475746 




gly" 


"41 


4.329043 




val' 


"42 


1.704864 




asp" 


"43 


25.889742 




tyr" 


"44 


89.194855 


45 


asn" 


"45 


109.981819 




his" 


"46 


0.268693 




pro" 


"47 


66.580925 




asp" 


"48 


0.000000 




leu' 


"49 


0.770882 


50 


ALA" 


"50 


49 . 618046 




arg" 


'51 


218.751709 




LYS" 


"52 


18.808538 




val" 


"53 


39.937984 




ile' 


"54 


98.478104 


55 


LYS' 


"55 


103.612228 




GLY 56 


17.199390 




TYR_57 


67.719147 
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ASP 


58 


0.000000 




phe"" 


59 


40*291119 




ILE 


"60 


50-151962 




ASP 


61 


70.078888 


5 


ARC 


'62 


166.777557 




asp"" 


63 


35.892376 




ASN 


"64 


120.641953 




asn" 


65 


64.982895 




pro"" 


66 


6.986028 


10 


met" 


67 


58.504269 




ASP" 


68 


28.668840 




leu"" 


"69 


104.467468 




ASN 


70 


78.460953 




GLY 


71 


5.615932 


15 


his" 


"72 


43.158905 




gly" 


73 


0.268693 




thr" 


"74 


0.000000 




HIS 


"75 


0.484127 




val" 


76 


1.880854 


20 


ALA" 


"77 


0.000000 




gly" 


"78 


0.933982 




thr" 


"79 


9.589676 




val" 


"80 


0.000000 




ALA" 


"81 


0.000000 


25 


ALA" 


"82 


0.000000 




asp" 


"83 


46.244987 




thr" 


"84 


27.783333 




asn" 


"85 


75.924225 




asn" 


"86 


44.813908 


30 


gly" 


"87 


50.453152 




ILE" 


"88 


74.428070 




gly" 


"89 


4.115077 




val" 


"90 


6.717335 




ALA" 


"91 


2.872341 


35 


gly]; 


"92 


0.233495 




met" 


"93 


5.876057 




ALA' 


"94 


0.000000 




pro" 


"95 


17.682203 




asp" 


"96 


83.431740 


40 


thr" 


"97 


1.506567 




LYS" 


"98 


72.674973 




ILE* 


"99 


4.251006 




leu" 


"100 


6.717335 




ALA" 


"101 


0.806080 


45 


val" 


"l02 


1.426676 




arg" 


"103 


2.662697 




val" 


"104 


2.171855 




leu" 


"105 


18.808538 




asp" 


"106 


52.167435 


50 


ALA" 


"107 


52.905663 




asn" 


"108 


115.871315 




gly" 


109 


30.943356 




ser" 


110 


57.933651 




gly" 


"ill 


50.705326 


55 


ser" 


"112 


56.383320 




LEU 


"ll3 


71.312195 




asp" 


"114 


110.410919 
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SER 115 


13.910152 




ILE 116 


22.570246 




ALA 


117 


5.642561 




SER 


118 


29.313131 


5 


GLY 


119 


0.000000 




ILE 


120 


1.343467 




ARG 


121 


118.391129 




TYR 


122 


44.203033 




ALA 


123 


0.000000 


10 


ALA" 


124 


7.974043 




ASP 


125 


83.851639 




GLN 


126 


64.311974 




gly"" 


127 


36.812618 




ALA" 


128 


4.705107 


15 


LYS 


129 


90.886139 




VAL 


130 


1.039576 




LEU" 


131 


2.149547 




asn" 


132 


4.315227 




leu" 


'133 


1.880854 


20 


SER 


"134 


3.563334 




leu" 


135 


26.371397 




gly" 


136 


59.151070 




CYS" 


"137 


63.333755 




GLU" 


'138 


111.553314 


25 


CYS"^ 


'l39 


83.591461 




asn" 


"140 


80.757843 




seeC 


"141 


25.899158 




thr" 


"142 


99.889725 




thr" 


"143 


73.323814 


30 


leu" 


"144 


5.589301 




LYS"" 


"145 


94.708755 




ser" 


"146 


72.636993 




ALA 


"147 


9.235920 




VAL" 


'148 


1.612160 


35 


ASP 149 


57.431465 




TYR 150 


106.352493 




ALA 


151 


0.268693 




TRP" 


"152 


43.133667 




asn" 


"153 


112.864975 


40 


LYS" 


"154 


110.009468 




gly" 


"155 


33.352180 




ALA" 


"156 


3.493014 




VAL" 


"157 


1.048144 




val" 


"158 


2.043953 


45 


VAL" 


"159 


0.000000 




ALA' 


160 


0.537387 




ALA' 


"161 


10.872165 




ALA* 


"162 


7.823834 




gly" 


"163 


12.064573 


50 


asn" 


"164 


81. 183388 




asp" 


"165 


64.495300 




ASN 166 


83.457443 




VAL 


167 


68.516815 




ser" 


"168 


78.799652 


55 


arg" 


"169 


116.937134 




thr" 


"170 


57.275074 




phe' 


"171 


51.416462 
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GLN 


172 


18 .934589 




PRO 


173 


1.880854 




ALA_ 


174 


6. 522357 




SER 


175 


26.184139 


5 


tyr"" 


176 


21.425076 




PRO 


177 


85.613541 




ASN 178 


34.700817 




ALA 


179 


0.268693 




ile" 


180 


1.074774 


10 


ALA 181 


3.761708 




VAL 


182 


0. 000000 




GLY 


183 


2. 149547 




ALA 


184 


0.951118 




ILE 


185 


0. 806080 


15 


ASP 


186 


30.022263 




ser"" 


"187 


72 . 518509 




ASN 


'188 


117 . 128021 




ASP 


"189 


47.601345 




arg" 


"190 


150. 050873 


20 


LYS 


"191 


64.822807 




ALA*" 


"192 


2.686934 




SER" 


"193 


96.223808 




PHE 


"194 


51.482613 




ser" 


"195 


1. 400973 


25 


ASN 


"196 


4 . 148808 




tyr" 


"197 


80. 937309 




gly" 


"198 


10.747736 




thr" 


"199 


93.221252 




TRP"" 


"200 


169 .943604 


30 


val" 


"201 


15.280325 




asp" 


"202 


12 . 141763 




val" 


"203 


0.268693 




thr" 


"204 


3.409728 




ALA" 


'205 


0. 000000 


35 


pro" 


"206 


0. 000000 




gly" 


"207 


0. 000000 




val' 


"208 


37. 137192 




asn" 


"209 


78 . 286270 




ile" 


"210 


9.404268 


40 


ALA] 


"211 


25. 938599 




ser" 


"212 


5. 037172 




thr" 


"213 


0-000000 




val" 


"214 


22 . 301552 




pro' 


"215 


45. 251030 


45 


ASN 216 


131.014160 




ASN 217 


88. 383461 




GLY 


218 


21. 226780 




tyr] 


"219 


88.907570 




ser" 


"220 


39.966541 


50 


tyr" 


"221 


166. 037018 




MET 


""222 


50.951096 




ser" 


223 


54.435001 




gly" 


""224 


1.880854 




thr" 


225 


1.634468 


55 


ser" 


"226 


17.432346 




met" 


227 


7.233279 




ALA" 


""228 


0.000000 
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SER_229 
PRO_230 
HIS_231 
VAL_232 
5 ALA_233 
GLY_234 
LEU_235 
ALA_236 
ALA_237 

10 LEU_238 
LEU_239 
ALA_240 
SER_241 
GLN_242 

15 GLy_243 
LyS_244 
ASN_245 
ASN_246 
VAL_247 

20 GLN_248 
ILE_249 
ARG_250 
GLN_251 
ALA_252 

25 ILE_253 
GLU_254 
GLN_255 
THR_256 
ALA_257 

30 ASP_258 
LyS_259 
ILE_260 
SER_261 
GLY_262 

35 THR_263 
GLY_264 
THR_265 
ASN_266 
PHE_267 

40 LYS_268 
TyR_269 
GLY_270 
LYS_271 
ILE_272 

45 ASN_273 
SER_274 
ASN_275 
LYS_276 
ALA_277 

50 VAL_278 
ARG_279 
TYR_280 
CA_281 
CA_282 

55 CA 283 



0.000000 
0.268693 
2.680759 
0.000000 
0.000000 

I. 074774 

II. 500556 
0.000000 
0.000000 

I. 612160 
0.000000 
10.648088 
39.138004 
71.056175 
66.487144 
43.256012 
80.728127 
34.859673 
84.145645 
51.819775 
8.598188 
35.055809 
71.928093 
0.000000 
4.845899 
13.344438 
81.705254 
9.836061 
2.810513 
44.656136 
113.071686 
32.089527 
91.590103 
26.450439 
38.308762 
46.870056 
88.551804 
34.698349 
7.756911 
103.212852 
37.638382 
0.000000 

II. 376978 
2.885231 
19.195255 
2.651736 
38.177547 
84.549576 
1.074774 
4.775503 
162.693054 
96.572929 
0.000000 
0.000000 
8.803203 



Subset REST: 
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restmole. list 
Subset REST: 

PD498FINALiyiODEL: 6-7 , 9-12 , 43-46 , 61-63 , 65 , 87- 
89 , 111-114 , 117-118 , 131 , 
5 PD498FINALMODEL: 137-139 , 158-159 , 169-171 , 173- 
174,180-181,209,211, 

PD498FINALMODEL: 216-221 , 232-233 , 262 , E282H 

restatom. list 
Subset REST: 
10 PD498FINALMODEL:PRO 6 :N, CA, CD, C, O, CB, CG 

PD4 9 SFINAIiMODEL : TYR 7 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , C2 , OH 
PD498FINALM0DEL: SER 9 : N , CA, C , O , CB , OG 
PD498FINALMODEL:ALA 10:N,CA,C,O,CB 

PD4 9 8FINALM0DEL : TYR 11 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , CZ , OH 
15 PD498FINALMODEL:GLN 12 :N,CA,C,0,CB, CG, CD,0E1,NE2 

PD4 9 8 FINALMODEL : ASP 4 3 : N , CA , C , O , CB , CG , ODl , 0D2 
PD4 9 8FINALM0DEL : TYR 

44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL : ASN 4 5 : N , CA , C , O , CB , CG , OD 1 , ND2 
2 0 PD4 9 8 FINALMODEL : HI S 

46:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL : ASP 6 1 : N , CA , C , O , CB , CG , ODl , 0D2 
PD4 9 8 FINALMODEL : ARG 
62:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
25 PD498FINALMODEL:ASP 63 :N,CA, C,0,CB, CG,0D1,0D2 

PD4 9 8 FINALMODEL : ASN 6 5 : N , CA , C , O , CB , CG , OD 1 , ND2 
PD4 9 8 FINALMODEL :GLY 87:N,CA,C,0 

PD4 9 8 FINALMODEL : ILE 88 : N , CA , C , O , CB , CGI , CG2 , CDl 

PD4 9 8 FINALMODEL :GLY 89:N,CA,C,0 
30 PD498FIN7VLMODEL:GLY 111:N,CA,C,0 

PD4 9 8 FINALMODEL: SER 112 :N, CA, C, O, CB,OG 

PD4 9 8 FINALMODEL : LEU 113 : N , CA , C , O , CB , CG , CDl , CD2 

PD4 9 8 FINALMODEL: ASP 114 :N, CA, C,0, CB, CG, 0D1,0D2 

PD498FINALM0DEL:ALA 117 : N, CA, C, O, CB 
35 PD498FINALMODEL:SER 118 :N,CA, C, O, CB, OG 

PD4 9 8 FINALMODEL : LEU 13 1 : N , CA , C , O , CB , CG , CDl , CD2 

PD498FINALMODEL:CYS 137 :N,CA,C,0,CB, SG 

PD4 9 8 FINALMODEL : GLU 
138:N,CA,C,0,CB,CG,CD,OE1,OE2 
40 PD4 9 8 FINALMODEL :CYS 139:N,CA,C,0,CB,SG 

PD4 9 8 FINALMODEL : VAL 15 8 : N , CA , C , O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : VAL 159 : N , CA , C ^ O , CB , CGI , CG2 

PD4 9 8 FINALMODEL : ARG 
169:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
4 5 PD4 9 8 FINALMODEL : THR 17 0 : N , CA , C , O , CB , OGl , CG2 

PD4 9 8 FINALMODEL : PHE 
171:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD4 9 8 FINALMODEL : PRO 17 3 : N , CA , CD , C , O , CB , CG 

PD4 9 8 FINALMODEL: ALA 174 :N, CA, C, O, CB 
50 PD4 9 8 FINALMODEL : ILE 18 0 : N , CA , C , 0 , CB , CGI , CG2 , CDl 

PD4 9 8 FINALMODEL: ALA 181:N,CA,C,0,CB 

PD4 9 8 FINALMODEL: ASN 209:N,CA,C,0,CB,CG,OD1,ND2 

PD4 9 8 FINALMODEL: ALA 211 :N, CA, C, O, CB 

PD4 9 8 FINALMODEL: ASN 2 16 : N , CA , C , O , CB , CG , ODl , ND2 
55 PD4 9 8 FINALMODEL: ASN 217 :N, CA, C, O, CB, CG, ODl, ND2 

PD4 9 8 FINALMODEL :GLY 218:N,CA,C,0 



- WO98/35026 



50 



PCTA>K98/00046 



PD4 SSFINALjMODEL : TYR 

219:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD498FINALMODEL:SER 220:N, CA, C,0, CB,OG 
PD4 9 8 FINALMODEL : TYR 
5 221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD498FINAIiMODEL: VAL 232 : CA, C , O , CB, CGI , CG2 
PD4 9 8 FINALMODEL: ALA 233 : N, CA, C, O, CB 
PD4 9 8 FINALMODEL :GLY 262:N,CA,C,0 
PD498FINALM0DEL:CA E282H:CA 

10 

Sxibset SUB5B: 

subSbmole* list 
Subset SUB5B: 

PD498FINALMODEL: 4-5, 8, 13-16, 34-35, 47- 
15 51,53,64,83,85-86,90-91,120-124, 

PD4 9 8 FINALMODEL: 128-130, 140-141, 143-144 , 147- 
148, 151-152 , 156-157, 

PD498FINALMODEL: 165 , 167-168 , 172 , 175-176 , 178- 
179,196,200-205,208, 
20 PD498FINALMODEL: 2 34-237, 250, 253-254, 260-261, 263- 

267,272,E281H, 

PD498FINALMODEL: E283H 

subSbatom. list 

25 Subset SUB5B: 

PD4 9 8 FINALMODEL : ASN 4 : N , CA , C , O , CB , CG , OD 1 , ND2 
PD498FINALMODEL: ASP 5 : N, CA, C , O , CB , CG , ODl , 0D2 
PD4 9 8 FINALMODEL : TYR 
8:N,CA,C,0,CB,CG,CD1,CD2^CE1,CE2,CZ,0H 

30 PD498FINALMODEL: TYR 

13:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL :GLY 14:N,CA,C,0 
PD4 9 8FINALM0DEL : PRO 15 : N , CA , CD , C , O , CB , CG 
PD4 9 8 FINALMODEL : GLN 1 6 : N , CA , C , O , CB , CG , CD , OE 1 , NE2 

35 PD4 9 8 FINALMODEL :THR 34 :N, CA, C, O, CB,0G1, CG2 

PD4 9 8FINALMODEL : VAL 3 5 : N , CA , C , O , CB , CGI , CG2 
PD4 9 8 FINALMODEL: PRO 47 :N, CA, CD, C, O, CB, CG 
PD4 9 8 FINALMODEL: ASP 48 :N, CA, C, O, CB, CG, ODl ,0D2 
PD4 9 8FINALM0DEL : LEU 4 9 : N , CA , C , O , CB , CG , CDl , CD2 

40 PD498FINAIiMODEL:ALA 50 :N, CA, C,0, CB 

PD4 9 8FINALM0DEL : ARG 

51:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL : VAL 5 3 : N , CA , C , O , CB , CG 1 , CG2 
PD4 98FINALMODEL : ASN 64 : N , CA , C , O , CB , CG , ODl , ND2 

45 PD498FINALMODEL:ASP 83 :N, CA, C, O, CB, CG, ODl , 0D2 

PD4 9 8 FINALMODEL : ASN 8 5 : N , CA , C , O , CB , CG , ODl , ND2 
PD498FINALMODEL:ASN 86 :N, CA, C, O, CB, CG,0D1,ND2 
PD498FINALM0DEL: VAL 90 : N , CA, C , O , CB , CGI , CG2 
PD498FINALMODEL:ALA 91:N,CA,C,0,CB 

5 0 PD4 9 8FINALM0DEL : ILE 12 0 : N , CA , C , O , CB > CGI , CG2 , CDl 

PD4 9 8 FINALMODEL : ARG 

121:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8FINALM0DEL : TYR 
122 :N, CA, C, O, CB , CG, CDl , CD2 , CEl , CE2 , CZ , OH 

55 PD498FINALMODEL:ALA 123 : N, CA, C,0 , CB 

PD498FINALMODEL : ALA 124 : N , CA , C , O , CB 
PD498FINALMODEL:ALA 128:N,CA,C,0,CB 
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PD498FINALMODEL:LYS 129 :N, CA, C,0, CB, CG, CD, CE,NZ 
PD4 9 8 FINALMODEL : VAL 1 3 0 : N , CA , C , O , CB , CGI , CG2 
PD4 9 BFINALMODEL : ASN 14 0 : N , CA , C , O , CB , CG , ODl , ND2 
PD4 9 BFINALMODEL : SER 14 1 : N , CA , C , O , CB , OG 
5 PD498FINALMODEL:THR 143 :N,CA,C,0, CB,0G1,CG2 

PD4 9 BFINALMODEL : LEU 144 : N , CA , C , O , CB , CG , CDl , CD2 
PD498FINALMODEL:ALA 147 :N, CA, C, O, CB 
PD4 9 BFINALMODEL: VAL 148 :N, CA, C, O, CB, CGI, CG2 
PD4 9 8 FINALMODEL: ALA 151 :N, CA, C, O, CB 

10 PD498FINALM0DEL: TRP 

52:N,CA,C,0,CB,CG^CD1,CD2,NE1,CE2,CE3, 
CZ2,CZ3,CH2 
PD498FINALMODEL:ALA 156 :N, CA, C,0, CB 
PD498FINALMODEL:VAL 157:N,CA,C,0,CB,CG1,CG2 

15 PD498FINALMODEL:ASP 165:N,CA,C,0,CB,CG,ODl,OD2 

PD4 9 8 FINALMODEL : VAL 1 67 : N , CA , C , O , CB , CGI , CG2 
PD498FINALMODEL:SER 168 : N, CA, C, O, CB,OG 
PD498FINALMODEL: GLN 

172:N,CA,C,0,CB,CG,CD,0E1,NE2 

20 PD4 9 8 FINALMODEL: SER 175 :N, CA, C, O, CB,OG 

PD4 9 BFINALMODEL : TYR 

17 6 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 /CZ , OH 
PD4 9 8 FINALMODEL : ASN 1 7 8 : N , CA , C , O , CB , CG , ODl , ND2 
PD498FINALM0DEL:ALA 179 :N, CA, C,0, CB 

25 PD4 9 8 FINALMODEL: ASN 196:N,CA, C,0,CB,CG,0D1,ND2 

PD4 9 8 FINALMODEL : TRP 

200:N,CA,C,O,CB,CG,CDl,CD2,NEl,CE2,CE3, 
CZ2,CZ3,CH2 
PD498FINALMODEL:VAL 201:N,CA,C,0,CB,CG1,CG2 

30 PD49BFINALMODEL:ASP 202 :N, CA, C, 0, CB, CG, ODl , 0D2 

PD4 9 8 FINALMODEL : VAL 2 0 3 : N , CA , C , O , CB , CGI , CG2 
PD4 9 BFINALMODEL :THR 204 :N, CA, C,0, CB,0G1,CG2 
PD49BFINAIiMODEL:ALA 205 :N, CA, C,0, CB 
PD4 9 BFINALMODEL : VAL 2 08 : N , CA , C , O , CB , CGI , CG2 

35 PD4 9 8 FINALMODEL :GLY 234:N,CA,C,0 

PD4 9 8 FINALMODEL : LEU 2 35 : N , CA , C , O , CB , CG, CDl , CD2 
PD498FINALMODEL:ALA 236 :N, CA/C, O, CB 
PD49BFINALM0DEL:ALA 237 :N, CA, C, O, CB 
PD4 9 8 FINALMODEL : ARG 

40 250:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

PD4 9 BFINALMODEL : ILE 2 53 : N , CA , C , O , CB , CGI , CG2 , CDl 
PD498FINALMODEL: GLU 

254:N,CA,C,0,CB,CG,CD,OEl,OE2 
PD498FINALMODEL:ILE 260 :N, CA, C, 0, CB, CGI , CG2 , CDl 

45 PD49BFINALMODEL:SER 261 : N, CA, C, O, CB, OG 

PD49BFINALM0DEL: THR 2 63 : N , CA, C , O , CB , OGl , CG2 
PD4 9 8 FINALMODEL :GLY 264:N,CA,C,0 
PD4 9 8 FINALMODEL: THR 265 :N, CA, C, O, CB, OGl , CG2 
PD4 9 BFINALMODEL : ASN 2 66 : N , CA , C , O , CB , CG , ODl , ND2 

50 PD4 9 BFINALMODEL :PHE 

267:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
PD4 9 8 FINALMODEL : ILE 2 7 2 : N , CA , C , O , CB , CGI , CG2 , CD 1 
PD4 9 BFINALMODEL :CA E281H:CA 
PD498FINALM0D£L:CA E283H:NA 

55 

Subset ACTSITE: 

actsitexaole. list 
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PD498FINALMODEL: 36-42 , 57-60,66-80 , 100-110 , 115- 
116, 119, 132-136, 160-164 , 

PD498FINALMODEL: 182-184 , 194 ,206-207 , 210 ,212- 
5 215,222-231 

actsiteatom. list 
Subset ACTSITE: 

PD498FINALM0DEL:AIiA 36 :N, CA, C,0, CB 

10 PD498FINAIiMODEL:VAL 37:N,CA,C,0,CB,CG1,CG2 

PD4 9 8 FINALMODEL : LEU 3 8 : N , CA , C , O , CB , CG , GDI , CD2 
PD4 9 8 FINALMODEL : ASP 3 9 : N , CA , C , O , CB , CG , ODl , OD2 
PD498FINALMODEL:SER 40:N, CA, C,0,CB,OG 
PD498FINALMODEL:GLY 41:N,CA,C,0 

15 PD498FINALMODEL:VAL 42:N,CA,C,0,CB,CG1,CG2 

PD498FINALMODEL: TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD498FINALMODEL:ASP 58:N,CA,C,0,CB,CG,ODl,OD2 
PD498FINALMODEL: PHE 

20 59 : N, CA, C, O , CB /CG , CDl , CD2 , CEl , CE2 , CZ 

PD4 9 8 FINALMODEL : ILE 60 : N , CA, C , O , CB , CGI , CG2 , CDl 
PD498FINALMODEL: PRO 66 : N, CA, CD , C, O, CB, CG 
PD498FINALM0DEL:MET 67 :N,CA, C, 0,CB, CG, SD, CE 
PD4 9 8 FINALMODEL : ASP 6 8 : N , CA , C , O , CB , CG , ODl , 0D2 

25 PD498FINALMODEL:LEU 69 :N, CA, C,0,CB, CG, CDl, CD2 

PD4 9 8 FINALMODEL :ASN 70:N, CA, C,0,CB,CG,0D1,ND2 
PD4 9 8 FINALMODEL :GLY 71:N,CA,C,0 
PD498FINALMODEL: HIS 

72:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 PD4 9 8 FINALMODEL :GLY 73:N,CA,C,0 

PD 4 9 8 FINALMODEL : THR 7 4 : N , CA , C , O , CB , OG 1 , CG2 
PD4 9 8 FINALMODEL : HI S 

^ 75:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
PD4 9 8 FINALMODEL : VAL 7 6 : N , CA , C , O , CB , CGI , CG2 

35 PD498FINALMODEL:ALA 77 :N, CA,C,0,CB 

PD498FINAIiM0DEL:GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL: THR 79 : N , CA , C , O , CB , OGl , CG2 
PD4 9 8 FINALMODEL : VAL 8 0 : N , CA, C , O , CB , CGI , CG2 
PD498FINALMODEL: LEU 100 : N , CA, C , O , CB , CG , CDl , CD2 

40 PD498FINALMODEL:ALA 101:N,CA,C,0,CB 

PD4 9 8 FINALMODEL : VAL 102 : N , CA , C, O , CB , CGI , CG2 
PD4 9 8 FINALMODEL : ARG 

103:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
PD4 9 8 FINALMODEL : VAL 104 : N , CA , C , O , CB , CGI , CG2 

4 5 PD4 9 8FINALM0DEL: LEU 105 : N , CA , C , O , CB , CG , CDl , CD2 

PD4 9 8FINALM0DEL : ASP 10 6 : N , CA , C , O , CB , CG , ODl , 0D2 
PD498FINALMODEL:ALA 107:N,CA,C,0,CB 
PD4 9 8FINALM0DEL : ASN 108 : N , CA , C , O , CB , CG , ODl , ND2 
PD498FINALMODEL:GLY 109:N,CA,C,0 

50 PD498FINALMODEL:SER 110:N,CA, C,0,CB,OG 

PD4 9 8FINALM0DEL : SER 1 15 : N , CA , C , O , CB , OG 
PD498FINALMODEL:ILE 116 : N, CA, C, O, CB, CGI , CG2 , CDl 
PD4 9 8 FINALMODEL :GLY 119:N,CA,C,0 
PD4 9 8 FINALMODEL : ASN 1 3 2 : N , CA , C , O , CB , CG , ODl , ND2 

55 PD498FINALMODEL:LEU 133 :N, CA, C, O, CB, CG, CD1,CD2 

PD498FINALMODEL: SER 134 :N, CA, C, O, CB, OG 
PD4 9 8FINALM0DEL : LEU 13 5 : N , CA , C , O , CB , CG , CDl , CD2 
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PD498FINALMODEL:GLY 136:N,CA,C,0 

PD498FINAIiM0DEL:ALA 160:N,CA,C,0,CB 

PD498FINAmODEL:ALA 161 :N, CA, C, 0, CB 

PD498FINALMODEL:ALA 162 :N,CA,C,0,CB 

5 PD498FINAI*M0DEL:GLY 163:N,CA,C,0 

PD498FINALM0DEL : ASN 164 : N , CA , C , O , CB , CG , ODl , ND2 

PD498FINALiMOPEL:VAL 182 :N,CA,C,0, CB, CGI, CG2 

PD498FINALMODEL:GLY 183:N,CA,C,0 

PD498FINALM0DEL:ALA 184 :N,CA,C,0,CB 
10 PD498FINALMODEL : PHE 

194 :N, CA, C, O , CB, CG, GDI , CD2 , CEl , CE2 , CZ 

PD498FINALMODEL:PRO 206 :N, CA, CD, C, CB, CG 

PD498FINALM0DEL:GLY 207:N,CA,C,O 

PD4 98FINALMODEL : ILE 2 10 : N , CA , C , O , CB , CGI , CG2 , GDI 

15 PD498FINALMODEL:SER 212 :N,CA,C,0,CB,OG 

PD498FINALMODEL:THR 213 :N, CA, C,0, CB, OGl, CG2 

PD4 9 8FINALM0DEL : VAL 2 14 : N , CA , C , O , CB , CG 1 , CG2 

PD4 9 8 FINALMODEL : PRO 215: N , CA , CD , C , O , CB , CG 

PD498FINALMODEL:MET 222 :N, CA, C,0, CB, CG, SD, CE 

20 PD498FINALMODEL:SER 223 :N,CA,C,0, CB,OG 

PD4 9 8 FINALMODEL :GLY 224:N,CA,C,0 

PD4 9 8FINALM0DEL : THR 2 2 5 : N , CA , C , O , CB , OGl , CG2 

PD498FINALMODEL:SER 226 :N,CA, C,0,CB,OG 

PD498FINALMODEL:MET 227 :N,CA,C,0,CB,CG,SD,CE 

25 PD498FINALMODEL:ALA 228:N,CA,C,0,CB 

PD498FINALMODEL:SER 229 :N, CA, C, O, CB , OG 

PD4 9 8 FINALMODEL : PRO 2 3 0 : N , CA, CD , C , O , CB , CG 
PD498FINALM0DEL : HIS 
231:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

30 

Subset RESTx: 

restxinole. list 
Subset RESTX: 

NEWMODEL: 6-7 , 9-12 , 43-46 , 65 , 87- 
35 89,131,173,209,211,216-221,232-233, 
NEWMODEL: 262 , E282H 

restxatom. list 
Subset RESTX : 
40 NEWMODEL:PRO 6 :N, CA, CD, C, O, CB, CG 

NEWMODEL :TYR 
7 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , C Z , OH 
NEWMODEL:SER 9:N,CA,C,0,CB,0G 
NEWMODELxALA 10:N,CA,C,0,CB 
45 NEWMODEL :TYR 

11:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWM0DEL:GLN 12 :N,CA,C,0,CB,CG,CD,0E1,NE2 
NEWMODEL:ASP 43 :N, CA, C,0, CB, CG, OD1,OD2 
NEWMODEL :TYR 
50 44:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

NEWM0DEL:ASN 45:N, CA,C,0, CB,CG,0D1,ND2 
NEWMODEL : HIS 4 6 : N , CA , C , O , CB , CG , NDl , CD2 , CEl , NE2 
NEWMODEL : ASN 6 5 : N , CA , C , O , CB , CG , ODl , ND2 
NEWMODEL: GLY 87:N,CA,C,0 
55 NEWM0DEL:ILE 88 :N, CA, C, 0, CB, CG1,CG2 , CDl 

NEWMODEL:GLY 89:N,CA,C,0 

NEWMODEL:LEU 131:N,CA,C,0,CB,CG,CD1,CD2 
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NEWMODELrPRO 173 :N, CA,CD,C,0,CB,CG 
NEWMODEL: ASN 209 : N , CA , C , O , CB , CG , ODl , ND2 
NEWMODEL:ALA 211:N, CA, C,0,CB 
NEWMODEL: ASN 216 :N, CA, C,0, CB, CG,0D1, ND2 
5 NEWMODEL:ASN 217 :N,CA,C,0,CB,CG,0D1,ND2 

NEWMODEL:GLy 218:N,CA,C,0 
NEWMODEL :TYR 
219:N,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 
NEWMODEL:SER 220:N,CA,C,O,CB,OG 

10 NEWMODEL :TYR 

221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
NEWMODEL: VAL 232 :N, CA, O, CB, CGI , CG2 
NEWMODEL:ALA 233 :N,CA,C,0,CB 
NEWM0DEL:GLY 262:N,CA,C,0 

15 NEWMODEL :CA E282H:CA 



Example 2 

Suitable substitutions in Savinase® for addition of amino 
20 attachment groups f-NH ?^ 

The known X-ray structure of Savinase® was used to find 
where suitable amino attachment groups may is added (Betzel et 
al, (1992), J. Mol. Biol. 223, p. 427-445). 

The 3D structure of Savinase® is available in the Brookhaven 
25 Databank as Isvn.pbd. A related subtilisin is available as 
lst3.pdb. 

The sequence of Savinase® is shown in SEQ ID NO. 3 
The sequence numbering used is that of subtilisin BPN' , 
Savinase® having deletions relative to BPN» at positions: 36, 
30 56, 158-159 and 163-164. The active site residues (functional 
site) are D32,H64 and S221. 

The commands performed in Insight (BIOSYM) are shown in the 
command files makeKzone.bcl and makeKzone2 .bcl below: 



35 Conservative substitutions: 
makeKzone.bcl 
Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
Zone Subset LYS :lys:NZ Static monomer/ residue 10 Color_Subset 
40 255,255,0 

Zone Subset NTERM :el:N Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :e32,e64,e221 Static monomer/ residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union LYS NTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 

#NOTE: editnextline object name according to the protein 
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Combine Subset REST Difference SAVI8 ALLZONE 
List Subset REST Atom Output^File restatom. list 
List Subset REST monomer /residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 List Subset ACTSITE Atom Output_File acts iteatom. list 
List Subset ACTSITE monomer /residue Output^File 
actsitemole. list 
# 

Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
10 Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File sub5batom. list 
List Subset SUB5B monomer /residue Output_File sub5bmole. list 
15 #Now identify sites for lys->arg substitutions and continue 
with makezone2,bcl 

#Use grep command to identify ARG in restatom. list, 
sub5batom. list & aces iteatom. list 

20 Comments: 

In this case of Savinase® REST contains the Arginines ArglO, 
Argl70 and Arg 186, and SUB5B contains Argl9, Arg45, Argl45 and 
Arg247. 

These residues are all solvent exposed. The substitutions 
25 RIOK, R19K, R45K, R145K, R170K, R186K and R247K are identified 
in Savinase® as sites for mutagenesis within the scope of this 
invention. The residues are substituted below in section 2, 
and further analysis done. The subset ACTSITE contains Lys94. 
The substitution K94R is a mutation removing Lysine as 
30 attachment group close to the active site. 



Non-conservative substitutions: 
makeKzone2 .bcl 

#sourcefile makezone2.bcl Claus Von der Osten 961128 
35 # 

#having scanned lists (grep arg command) and identified sites 
for lys->arg substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace SAVI8 newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On SAVI8 

#NOTE: editnextlines with lys->arg positions 

Replace Residue newmodel :el0 lys L 
45 Replace Residue newmodel:el70 lys L 

Replace Residue newmodel :el86 lys L 

Replace Residue newmodel :el9 lys L 

Replace Residue newmodel :e45 lys L 

Replace Residue newmodel :el45 lys L 
50 Replace Residue newmodel :e241 lys L 
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# 

#Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
5 Zone Subset LYSx newmodel: lysrNZ Static monomer/ residue 10 
Color_Subset 255,255,0 

Zone Subset NTERMx newmodel: el: N Static monomer /residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel :e3 2, e64,e2 21 Static 

monomer /residue 8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union LYSx NTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

Combine Subset RESTx Difference newmodel ALLZONEx 
15 List Subset RESTx Atom Output File res txatom. list 

List subset RESTx monomer/resTdue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
List Subset ACTSITEx Atom Output File act sitexatom. list 
20 List Subset ACTSITEx monomer /residue Output_File 
actsitexmole. list 
# 

#read restxatom.list or restxmole. list to identify sites for 
(not_arg) ->lys subst. if needed 

25 

Comments: 

Of the residues in RESTx, the following are >5% exposed (see 

lists below): 5,14,22,38-40,42,75-76,82,86,103-105,108,133- 

135 , 137 , 140 , 173 ,204 ,206 ,211-213 , 215-216 , 269 . The following 

30 mutations are proposed in Savinase®: P5K, P14K, T22K, T38K, 

H39K, P40K, L42K, L75K, N76K, L82K, P86K, S103K, V104K, S105K, 

A108K, A133K, T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, 

G211K, S212K, T213K, A215K, S216K, N269K- 

Relevant data for Example 2 : 

35 Solvent accessibility data for SAVINASE®: 

# SAVI8NOH20 Fri NOV 29 13:32:07 MET 1996 





# residue 


area 




ALA 1 


118.362808 




GLN 2 


49.422764 


40 


SER 3 


61.982887 




VAL 4 


71.620255 




PRO 5 


21.737535 




TRP 6 


58.718731 




GLY 7 


4.328117 


45 


ILE 8 


6.664074 




SER 9 


60.175900 




ARG 10 


70.928963 




VAL 11 


2.686934 




GLN 12 


72.839996 


50 


ALA 13 


0.000000 




PRO 14 


52.308453 




ALA 15 


38.300892 




ALA 16 


0.000000 
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HIS_17 
ASN_18 
ARG_19 
GLY_20 
5 LEU_21 
THR_22 
GLY_23 
SER_24 
GLY_25 
10 VAL_26 
LYS_27 
VAL_28 
AIiA_29 
VAL_30 
15 LEU_31 
ASP_32 
THR_33 
GLY_34 
ILE_35 
20 SER_3 6 
THR__37 
HIS_38 
PRO_39 
ASP_40 
25 LEU_41 
ASN_42 
ILE_43 
ARG_44 
GLY_45 
30 GLY_46 
ALA_47 
SER_48 
PHE_49 
VAL_50 
35 PR0_51 
GLY_52 
GLU_53 
PRO_54 
SER_55 
40 THR_56 
GLN_57 
ASP_58 
GLY_59 
ASN_60 
45 GLY_61 
HIS_62 
GLY_63 
THR_64 
HIS_65 
. 50 VAL_66 
ALA_67 
GLY_68 
THR_69 
ILE_70 
55 ALA_71 
ALA_72 
LEU 73 



41.826324 

136.376602 

105.678642 

48.231510 

17.196377 

36.781742 

0. 000000 

64.151276 

50.269905 

4.030401 

54.239555 

0.000000 

0.000000 

3.572827 

0.233495 

1.074774 

1.973557 

3.638052 

8.044439 

8.514903 

122.598907 

18.834011 

76.570526 

0. 000000 

19.684013 

88.870216 

56.117710 

110.647194 

26.935413 

35.515778 

21.495472 

34.876190 

52.647541 

23.364208 

110.408752 

80.282906 

43.033707 

124.444336 

60.284889 

47.103241 

120.803505 

12.784743 

61.742443 

56.760231 

1.576962 

38.590118 

0.000000 

0.537387 

0.968253 

1.612160 

0.000000 

2.801945 

9.074596 

0.000000 

4.577205 

0.000000 

47.290039 
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ASN_74 
ASN_75 
SER_76 
ILE_77 
5 GLY_78 
VAL_79 
LEU_80 
GLY_81 
VAL_82 

10 AIiA_83 
PRO_84 
SER_85 
ALA_86 
GLU_87 

15 LEU_88 
TYR_89 
ALA_90 
VAL_91 
LYS_92 

20 VAL_93 
LEU_94 
GLY_95 
ALA_96 
SER_97 

25 GLY_98 
SER_99 
GLY_100 
SER_101 
VAL_102 

30 SER_103 
SER_104 
ILE_105 
ALA_106 
GLN_107 

35 GLY_108 
LEU_109 
GLU_110 
TRP_111 
ALA_112 

40 GLY_113 
ASN_114 
ASN_115 
GLY_116 
MET_117 

45 HIS_118 
VAL_119 
ALA_120 
ASN_121 
LEU_122 

50 SER_123 
LEU_124 
GLY_125 
SER_126 
PRO_127 

55 SER_128 
PRO_129 
SER 130 



PCT/DK9R/00046 
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102.187248 

60.210400 

84.614494 

66.098572 

17.979534 

5.642561 

13.025185 

0.000000 

0.268693 

0.000000 

18.193810 

56.839039 

13.075745 

37.011765 

2.149547 

30.633518 

I. 343467 
0.779450 
5.862781 
0.466991 
10.747736 
8.707102 
41.414677 
96.066040 
33.374485 
67.664116 
35.571117 
54.096992 
52.695324 
62.929684 
8.683097 
15.852910 
14.509443 
94.463066 
0.000000 
0.537387 
63.227707 
55.500740 
0.502189 

II. 908267 
107.208527 
78.811234 
41.453194 
9.634291 
54. 022118 
5.105174 
0.268693 
0.233495 
0.537387 
4.004620 
21.927265 
55.952454 
40.241180 
107.409439 
57.988609 
85.021118 
20.460915 
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ALA 


131 


57.404362 




THR 


132 


74.438805 




leu"" 


133 


12.091203 




GLU 


134 


73.382019 


5 


GLN 


135 


114.870010 




ALA 


136 


2.122917 




VAL 


137 


1.074774 




ASN 


138 


55.622704 




SER 


139 


29.174965 


10 


ALA" 


140 


0.268693 




THR 


141 


27.962946 




SER 


142 


87.263145 




arg" 


143 


88.201218 




gly" 


144 


38.477882 


15 


VAL 


145 


2. 079151 




LEU" 


'146 


13.703363 




VAL 


147 


2.690253 




val" 


"148 


1.074774 




ALA" 


"149 


0.000000 


20 


ALA" 


"150 


4.356600 




SER 


"151 


0.000000 




gly" 


"152 


12.628590 




ASjf 


"153 


84.248703 




SER 


"154 


77.662354 


25 


gly" 


"155 


25.409861 




ALA" 


"156 


38.074570 




gly" 


"157 


40.493744 




SER" 


"158 


53.915291 




ile" 


'159 


4.352278 


30 


ser" 


"160 


12.458543 




tyr" 


'161 


29.670284 




pro" 


'162 


4.030401 




ala' 


"163 


0.968253 




ARG 


"164 


84.059120 


35 


tyr" 


'165 


28.641129 




ALA" 


'166 


68.193314 




ASN" 


"167 


61.686481 




ala" 


"168 


0.537387 




met" 


"169 


0.586837 


40 


ala" 


"170 


0.000000 




val' 


"171 


0. 000000 




gly" 


"l72 


0.000000 




ala" 


"173 


0.933982 




THR" 


'174 


3.013133 


45 


asp' 


'175 


34.551376 




gln" 


"176 


96.873039 




ASN' 


"177 


98.664368 




ASN 


"178 


41.197159 




ASN 


"179 


60.263512 


50 


arg' 


"180 


64.416336 




ala" 


"181 


7.254722 




ser" 


"182 


91.590881 




phe" 


"l83 


52.126518 




ser" 


"184 


2.101459 


55 


gln" 


"185 


15.736279 




TYR 


186 


44.287792 




gly" 


187 


5.114592 
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ALA 


188 


69.406563 




GLY 


189 


36.926083 




LEU 


190 


16.511177 




ASP 


191 


7.705349 


5 


ILE 


192 


0.268693 




VAL 


193 


4.299094 




ALA 


194 


0.000000 




pro" 


195 


0.806080 




gly"^ 


196 


0.000000 


10 


val" 


197 


25.257177 




ASN~ 


198 


82.177422 




VAL*" 


199 


10.747736 




GLN_ 


'200 


80.374527 




SER 


"2 01 


2.008755 


15 


thr" 


202 


0.000000 




TYR 


203 


80.679886 




PRO 


204 


34.632195 




gly"" 


205 


74.536827 




SER 


"206 


74.964920 


20 


THR 


"207 


57.070065 




tyr" 


208 


82.895500 




ALA" 


'209 


22.838940 




SER"" 


210 


69.045639 




leu" 


"211 


49.708279 


25 


asn" 


*212 


86.905457 




gly;; 


"213 


2.686934 




thr" 


"214 


4.669909 




ser" 


"215 


15.225292 




met" 


"216 


7.261287 


30 


ALA^ 


"217 


0.000000 




thr" 


"218 


0.000000 




pro" 


"219 


0.806080 




his" 


"220 


2.662697 




val" 


^221 


0.268693 


35 


ALA" 


"222 


0.000000 




gly" 


"223 


0.000000 




ALA" 


"224 


7.206634 




ALA" 


"225 


1.039576 




ALA" 


"226 


0.268693 


40 


leu' 


"227 


1.074774 




val' 


"228 


1.541764 




lys' 


"229 


39.262505 




gln" 


"230 


54.501614 




lys" 


"231 


81.154129 


45 


asn" 


"232 


30.004124 




pro" 


"233 


91.917931 




ser' 


"234 


102.856705 




TRP' 


"235 


64.639481 




ser" 


"236 


51.797619 


50 


ASN 


237 


24.866917 




val" 


"238 


78.458466 




gln" 


■"239 


73.981461 




ile' 


"240 


14.474245 




arc' 


"241 


41.242931 


55 


asn 


"242 


64.644814 




his' 


"243 


50.671440 




leu" 


"244 


5.127482 
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LYS 


245 


48.820000 




asn" 


"246 


115.264534 




thr" 


"247 


22.205376 




ALA" 


"248 


16.415077 


5 


thr" 


'249 


60.503101 




ser" 


"250 


74.511597 




leu" 


"251 


48.861599 




gly" 


"252 


39.124340 




ser" 


"253 


49.811481 


10 


thr" 


"254 


88.421982 




asn" 


'255 


72.490181 




leu" 


'256 


54.835758 




tyr" 


"257 


38.798912 




gly" 


"258 


3.620916 


15 


ser' 


"259 


35.017368 




gly" 


^260 


0.537387 




leu' 


"261 


8.598188 




val" 


"262 


4.519700 




asn" 


"263 


16.763659 


20 


ALA" 


"2 64 


3.413124 




GLU' 


"265 


37.942276 




ALA" 


"266 


15.871746 




ALA" 


"267 


3.947115 




thr" 


"268 


2.475746 


25 


arg" 


"269 


176.743362 




ion" 


"270 


0.000000 




ion" 


""271 


5.197493 




Subset REST: 



restmole. list 



3 0 g^^ y^ g^^ I^EST * 

SAVI8 : E5-E15 , E17-E18 , E22 , E38-E40 , E42-E43 , E73-E76 , E82-E86 , E103- 
E105, 

SAVI8 : E108-E109 , E111-E112 , E115-E116 , E122 , E128-E144 , E149- 
E150,E156-E157, 

35 SAVI8 : E160-E162 , E165-E168 , E170-E17 1 , E173 , E180-E188 , E190- 
E192,E200, 

SAVI8 : E203-E204 , E206 , E211-E213 , E215-E216, E227-E230 , E255- 
E259,E261-E262, 
SAVI8:E267-E269 
40 restatoin. list 
Subset REST: 

SAVI8:PR0 E5:N,CD,CA,CG,CB,C,0 

SAVI8:TRP E6:N,CA,CD2,CE2,NE1,CD1,CG,CE3,CZ3,CH2,CZ2,CB,C,0 
SAVI8:GLY E7:N,CA,C,0 
45 SAVI8:ILE E8 :N, CA, GDI, CGI, CB, CG2 , C,0 
SAVI8:SER E9:N,CA,0G,CB,C,0 

SAVI8:ARG ElO :N,CA,NH2 ,NH1, CZ, NE, CD,CG, CB, C, O 
SAVI8:VAL E11:N, CA, CG2 ,CG1 , CB, C,0 
SAVI8:GLN E12:N,CA,NE2,OEl,CD,CG,CB,C,0 
50 SAVI8:ALA E13 :N, CA, CB, C,0 

SAVI8:PR0 E14:N,CD,CA,CG,CB,C,0 
SAVI8:ALA E15:N,CA,CB,C,0 

SAVI8:HIS E17 :N, CA, CD2 ,NE2 , CEl ,ND1 , CG,CB, C,0 
SAVI8 : ASN E18 : N , CA , ND2 , ODl , CG , CB , C , O 
55 SAVI8:THR E22 :N,CA,CG2 ,0G1,CB, C, 0 
SAVI8:THR E38:N,CA,CG2,0G1,CB,C,0 
SAVI8:HIS E39:N,CA,CD2,NE2,CE1,MD1,CG,CB,C,0 
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SAVI8: 


PRO 


E40:N,CD,CA,CG,CB,C,0 




SAVI8: 


LEU 


E42:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 ; 


ASN 


E43:N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


ALA 


E73:N,CA,CB,C,0 


5 


SAVI8 : 


ALA 


E74:N,CA,CB,C,0 




SAVI8 : 


LEU 


E75:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8 : 


ASN 


E76:N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


LEU 


E82:N,CA,CD2,CD1,CG,CB,C,0 




SAVI8: 


GLY 


E83:N,CA,C,0 


10 


SAVI8: 


VAL 


E8 4 : N , CA , CG2 , CGI , CB , C , 0 




SAVI8: 


ALA 


E85:N,CA,CB,C,0 




SAVI8: 


PRO 


E86:N,CD,CA,CG,CB,C,0 




SAVI8 : 


SER 


E103: 


N,CA,OG,CB,C,0 




SAVI8 : 


VAL 


E104: 


N,CA,CG2 ,CG1,CB,C,0 


15 


SAVI8: 


SER 


E105: 


.N,CA,0G,CB,C,0 




SAVI8: 


ALA 


E108: 


N,CA,CB,C,0 




SAVI8: 


GLN 


E109: 


N,CA,NE2,0E1,CD,CG,CB,C,0 




SAVI8 : 


LEU 


Elll: 


N,CA,CD2,CD1,CG,CB,C,0 




SAVI8: 


GLU 


E112: 


N,CA,0E2,0E1,CD,CG,CB,C,0 


20 


SAVIB: 


GLY 


E115: 


N,CA,C,0 




SAVI8 : 


ASN 


E116: 


N,CA,ND2,0D1,CG,CB,C,0 




SAVI8 : 


ALA 


E122: 


N,CA,CB,C,0 




S AVI 8 1 


SER 


E128: 


N,CA,OG,CB,C,0 




SAVI8: 


PRO 


E129: 


.N,CD,CA,CG,CB,C,0 


25 


SAVI8 : 


SER 


E130: 


;N,CA,OG,CB,C,0 




SAVI8 J 


PRO 


E131: 


:N,CD,CA,CG,CB,C,0 




SAVI8 J 


SER 


E132: 


;N,CA,OG,CB,C,0 




SAVI8: 


ALA 


E133: 


:N,CA,CB,C,0 




SAVI8 : 


:THR 


E134: 


:N,CA,CG2,0G1,CB,C,0 


30 


SAVI8: 


:LEU 


E135: 


;N,CA,CD2,CD1,CG,CB,C,0 




SAVI8: 


:GLU 


E136: 


:N,CA,0E2,0E1,CD,CG,CB,C,0 




SAVI8: 


:GLN 


E137: 


:N,CA,NE2,0E1,CD,CG,CB,C,0 




SAVI8 : 


:ALA 


E138: 


:N,CA,CB,C,0 




SAVI8 ' 


:VAL 


E139J 


:N,CA,CG2,CG1,CB,C,0 


35 


SAVI8 


:ASN 


E140J 


:N,CA,ND2,0Dl,CG,CB,C,O 




SAVI8: 


:SER 


E141: 


:N,CA,OG,CB,C,0 




SAVI8 


:ALA 


E142J 


:N,CA,CB,C,0 




SAVI8 


:THR 


E143' 


jN,CA,CG2,0G1,CB,C,0 




SAVI8 


:SER 


E144 3 


:N,CA,OG,CB,C,0 


40 


SAVI8 


:VAL 


E149: 


:N,CA,CG2,CG1,CB,C,0. 




SAVI8 


:VAL 


E150 


:N,CA,CG2,CG1,CB,C,0 




SAVI8 


:SER 


E156 


:N,CA,OG,CB,C,0 




SAVI8 


:GLY 


E157 


:N,CA,C,0 




SAVI8 


;ALA 


E160 


:N,CA,CB,C,0 


45 


SAVI8 


:GLY 


E161 


:N,CA,C,0 




SAVI8 


:SER 


E162 


:N,CA,OG,CB,C,0 




SAVI8 


:ILE 


E165 


:N,CA,CD1,CG1,CB,CG2,C,0 




SAVI8 


:SER 


E166 


:N,CA,OG,CB,C,0 




SAVI8 


:TYR 


E167 


:N,CA,0H,CZ,CD2 ,CE2 , CEl , GDI , CG, CB, C,0 


50 


SAVI8 


:PRO 


E168 


:N,CD,CA,CG,CB,C,0 




SAVI8 


:ARG 


E170 


:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 




SAVI8 


:TYR 


E171:N,CA,0H,CZ,CD2,CE2,CE1,CD1^CG,CB,C, 




SAVI8 


:ASN 


E173 


:N,CA,ND2,0D1, CG,CB,C,0 




SAVI8 


:THR 


E180:N,CA,CG2,OG1,CB,C,0 


55 


SAVI8 


:ASP 


E181 


:N,CA,OD2,ODl,CG,CB,C,0 




SAVI8 


:GLN 


E182 


:N,CA,NE2,0E1,CD,CG,CB,C,0 




SAVI8 


:ASN 


E183 


:N,CA,ND2,0D1,CG,CB,C,0 
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SAVI8: 


ASN 


E184: 


N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8: 


ASN 


E185: 


N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8: 


ARG 


E186: 


N, 


CA,NH2,NH1,CZ,NE,CD,CG,CB,C, 


0 




SAVI8 : 


ALA 


E187: 


N| 


CA,CB,C,0 




5 


SAVI8 : 


SER 


E188: 


N| 


CA,OG,CB,C,0 






SAVI8: 


SER 


E190: 


:N, 


CA,OG,CB,C,0 






SAVI8: 


GLN 


E191J 


:N, 


CA,NE2,0E1,CD,CG,CB,C,0 






SAVI8 : 


TYR 


E192: 


;N, 


CA,0H,CZ,CD2^CE2,CE1,CD1,CG, 


CB,C,0 




SAVI8 : 


ALA 


E200; 


:N, 


CA,CB,C,0 




10 


SAVI8 : 


VAL 


E203: 


:N, 


CA,CG2,CG1,CB,C,0 






SAVI8: 


ASN 


E204J 


:N, 


CA,ND2,0D1,CG,CB,C,0 






SAVI8: 


GLN 


E206: 


iN, 


CA,NE2,0E1,CD,CG,CB,C,0 






SAVI8: 


IGLY 


E211: 


:N, 


CA,C,0 






SAVI8 : 


SER 


E212: 


:N, 


CA,OG,CB,C,0 




15 


SAVI8: 


THR 


E213 


:N, 


.CA,CG2,0G1,CB,C,0 






SAVI8; 


ALA 


E215 


:N, 


CA,CB,C,0 






SAVI8I 


:SER 


E216: 


:N, 


.CA,OG,CB,C,0 






SAVI8: 


;VAL 


E227: 


:N, 


,CA,CG2,CG1,CB,C,0 






SAVI8: 


ALA 


E228 


:N, 


pCA,CB,C,0 




20 


SAVI8: 


.GLY 


E229 


:N, 


,CA,C,0 






SAVI8: 


:ALA 


E230 


:N, 


,CA,CB,C,0 






SAVI8 


:THR 


E255, 


:N, 


,CA,CG2,0G1,CB,C,0 






SAVI8, 


:S£R 


E256, 


'N, 


,CA,OG,CB,C,0 






SAVI8 ; 


:LEU 


E257 


:N, 


,CA,CD2,CD1,CG,CB,C,0 




25 


SAVI8 ' 


:GLY 


E258 


:N, 


pCA,C,0 






SAVI8 


:SER 


E259 


:N, 


,CA,OG,CB,C,0 






SAVI8 


:ASN 


E261 


:N, 


,CA,ND2,0D1,CG,CB,C,0 






SAVI8 


:LEU 


E262 


:N, 


,CA,CD2,CD1,CG,CB,C,0 






SAVI8 


:LEU 


E267 


:N 


,CA,CD2,CD1,CG,CB,C,0 




30 


SAVI8 


:VAL 


E268 


:N, 


,CA,CG2,CG1,CB,C,0 






SAVI8 


:ASN 


E269 


:N 


,CA,ND2,0D1,CG,CB,C,0 





Subset SUB5B: 

subSbmole. list 
Subset SUB5B: 

35 SAVI8 : E2-E4 , E16 , E19-E2 1 , E23-E24 , E28 , E37 , E41 , E44-E45 , 
E77-E81,E87-E88, 

SAVI8 : E90 , E113-E114 , E117-E118 , E120-E121, E145- 
E148 , E169 , E172 , E174-E176 , 
SAVI8 : E193-E196 , E198-E199 , E2 14 , E23 1- 
40 E234,E236,E243,E247,E250/E253-E254, 

SAVI8 : E260 , E263-E266, E270-E273 ,M276H-M277H 

subSbatoin. list 
Subset SUB5B: 

SAVI8:GLN E2:N,CA,NE2,0E1,CD,CG,CB,C,0 
45 SAVI8:SER E3:N,CA,0G,CB,C,0 

SAVI8:VAL E4 :N, CA, CG2 , CGI , CB, C, O 
SAVI8:ALA E16 :N, CA, CB, C,0 

SAVI8:ARG E19:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 

SAVI8:GLY E20:N,CA,C,O 
50 SAVI8:LEU E21:N,CA, CD2 ,CD1,CG,CB,C,0 

SAVI8:GLY E23:N,CA,C,0 

SAVI8:SER E24:N,CA,OG,CB,C,0 

SAVI8:VAL E28:N,CA,CG2,CG1,CB,C,0 

SAVI8:SER E37:N,CA,OG,CB,C,0 
55 SAVI8:ASP E41:N,CA,OD2,ODl,CG,CB,C,0 

S AVI 8 : ILE E4 4 : N , CA , CD 1 , CG 1 , CB , CG2 , C ^ O 

SAVI8:ARG E45:N,CA,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



SAVI8 
SAVI8 
SAVI8 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 
SAVIB 



ASN E77:N,CA,ND2,0D1,CG,CB,C,0 

SER E78:N,CA,OG,CB,C,0 

ILE E79:N,CA,CD1,CG1,CB,CG2,C,0 

GLY EB0:N,CA,C,O 

VAL EB1:N,CA,CG2,CG1,CB,C,0 

SER EB7:N,CA,0G,CB,C,0 

ALA EB8:N,CA,CB,C,0 

LEU E90:N^CA,CD2,CD1,CG,CB,C,0 



TRP E113 
ALA E114 
ASN E117 
GLY EllB 
HIS E120 
VAL E121 
ARG E145 
GLY E146 
VAL E147 
LEU E14B 
ALA E169 
ALA E172 
ALA E174 
MET E175 
ALA E176 
GLY E19 3 
ALA E194 
GLY E195 
LEU E196 
ILE E198 
VAL E199 
TYR E214 
ALA E231 
ALA E232 
LEU E23 3 
VAL E234 
GLN E236 
ASN E243 
ARG E247 
LEU E250 
THR E253 
ALA E254 
THR E260 
TYR E263 



:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


rCA, 


:N, 


rCA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


CA, 


:N, 


rCA, 


:N, 


CA, 


:N. 


CA, 


:N, 


rCA, 


:N, 


rCA, 


:N, 


rCA, 


:N, 


rCA, 


:N, 


rCA, 


:N, 


rCA, 


:N, 


rCA, 


:N 


rCA, 


:N, 


rCA, 


:N 


pCA, 




rCA, 



,CD2,CE2,NE1,CD1,CG,CE3,CZ3,CH2,CZ2,CB,C,0 
,CB,C,0 

,ND2,0D1,CG,CB,C,0 
,C,0 

,CD2,NE2,CE1,ND1,CG,CB,C,0 
,CG2,CG1,CB,C,0 
,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
,C,0 

,CG2,CG1,CB,C,0 
,CD2,CD1,CG,CB,C,0 
,CB,C,0 
,CB,C,0 
,CB,C,0 

,CE,SD,CG,CB,C,0 
,CB,C,0 
,C,0 
,CB,C,0 
,C,0 

,CD2,CD1,CG,CB,C,0 
, CDl , CGI , CB, CG2 , C, O 
,CG2,CG1,CB,C,0 

, OH , CZ , CD2 , CE2 , CEl , CD 1 , CG , CB , C , O 
,CB,C,0 
,CB,C,0 

,CD2,CD1,CG,CB,C,0 
,CG2,CG1,CB,C,0 
,NE2,0E1,CD,CG,CB,C,0 
,ND2,0D1,CG,CB,C,0 
,NH2,NH1,CZ,NE,CD,CG,CB,C,0 
,CD2,CD1,CG,CB,C,0 
,CG2,0G1,CB,C,0 
,CB,C,0 

,CG2,0G1,CB,C,0 

,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 

,c,o 

,OG,CB,C,0 
,C,0 
,CB,C,0 

,OE2,OEl,CD,CG,CB,C,0 
,CB,C,0 
,CB,C,0 



55 



GLY E264! 
SER E265: 
GLY E266; 
ALA E270: 
GLU E271J 
ALA E272; 
ALA E273: 
ION M276H:CA 
ION M277H:CA 
Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

SAVIB : E29-E35 , E48-E51 , E54 , E58-E72 , E91-E102 , E106-E107 , EllO , E123- 
E127, 
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SAVI8 : E151-E155 , E177-E179 , E189 , E201-E202 , E205 , E207-E2 10 , E2 17- 
E226 

actsiteatom. list 



5 


Subset ACTSITE: 






SAVI8: 


ALA 


E29: 


N 




SAVI8 : 


VAL 


E30: 


N 




SAVI8 : 


LEU 


E31: 


N 




SAVI8: 


ASP 


£32: 


N 


10 


SAVI8: 


THR 


E33: 


N 




SAVI8: 


GLY 


E34: 


N 




SAVI8: 


ILE 


E35: 


N 




SAVI8: 


ALA 


E48: 


N 




SAVI8 : 


SER 


E49i 


N 


15 


SAVI8 : 


PHE 


E50: 


N 




SAVI8 : 


VAL 


E5l! 


N 




SAVI8 : 


6LU 


E54: 


,N 




SAVI8: 


THR 


E58: 


:N 




SAVI8: 


GLN 


E59! 


•N 


20 


SAVI8 : 


ASP 


E60: 


N 




SAVI8 : 


GLY 


E61: 


.N 




SAVI8 : 


ASN 


E62: 


:N 




SAVI8 : 


GLY 


E63: 


:N 




SAVI8 : 


HIS 


E64: 


:N 


25 


SAVI8 : 


GLY 


E65: 


:N 




SAVI8 : 


THR 


E66< 


;N 




SAVI8 : 


HIS 


E67 


:N 




SAVI8 : 


VAL 


E68 


:N 




SAVI8 : 


,ALA 


E69 


:N 


30 


SAVI8 : 


.GLY 


E70 


;N 




SAVI8! 


THR 


E71 


:N 




SAVI8 ; 


:ILE 


E72 


:N 




SAVI8 ; 


:TYR 


E91 


:N 




SAVI8 1 


:ALA 


E92 


:N 


35 


SAVI8! 


iVAL 


E93 


:N 




SAVI8 ! 


:LYS 


E94 


:N 




SAVI8 


:VAL 


E95 


:N 




SAVI8 , 


:LEU 


E96 


:N 




SAVI8 


:GLY 


E97 


:N 


40 


SAVI8 


:ALA 


E98 


:N 




SAVI8 


:SER 


E99 


:N 




SAVI8 


:GLY 


ElOO: 




SAVI8 


:SER 


ElOl: 




SAVI8 


:GLY 


E102: 


45 


SAVI8 


:SER 


E106: 




SAV18 


:IL£ 


E107: 




SAVI8 


:6LY 


EllO: 




SAVI8 


:ASN 


E123: 




SAVIB 


:LED 


E124: 


50 


SAVI8 


:SER 


E125: 




SAVI8 


:LEU 


E126: 




SAVI8 


:GLY 


E127: 




SAVI8 


:ALA 


E151: 




SAVI8 


:ALA 


E152: 


55 


SAVI8 


:SER 


E153: 




SAVI8 


:GLY 


E154: 




SAVI8 


:ASN 


E155: 



,CA,OD2,ODl,CG,CB,C, 
,CA,C,0 

,CA,ND2,0D1,CG,CB,C,0 
,CA,C,0 

,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
,CA,C,0 

,CA,CG2,0G1,CB,C,0 
CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
CA,CG2,CG1,CB,C,0 
CA,CB,C,0 
CA,C,0 

CA,CG2,OGl,CB,C,0 
CA , CDl , CG 1 , CB , CG2 , C , O 
CA,0H,CZ,CD2,CE2,CE1,CD1,CG,CB,C,( 
^,CB,C,0 



CA,CB,C,0 
,CA,OG,CB,C,0 
N,CA,C,0 

N,CA,ND2,0D1,CG,CB,C,( 
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SAVI8:VAL E177:N,CA,CG2,CG1,CB,C,0 
SAVI8:GLY E178 :N,CA, C,0 
SAVI8:ALA E179 :N, CA, CB, C, O 

SAVI8:PHE E189:N,CA,CD2,CE2,CZ, CEl, CDl, CG,CB,C,0 
5 SAVI8:PR0 E201:N,CD,CA,CG,CB,C,0 

SAVI8:GLY E202:N,CA,C,O 

SAVI8:VAL E205:N,CA,CG2,CG1,CB,C,0 

SAVI8:SER E207:N,CA,OG, CB,C,0 

SAVI8:THR E208:N,CA,CG2,OG1,CB,C,0 
10 SAVI8:TYR E209:N,CA,OH,CZ,CD2,CE2,CE1,CD1,CG,CB,C,0 

SAVI8:PR0 E210 :N, CD, CA, CG, CB, C,0 

SAVI8:LEU E217:N,CA,CD2,CD1,CG,CB,C,0 

SAVI8:ASN E218:N,CA,ND2,0D1,CG,CB,C,0 

SAVI8:GLY E219:N,CA,C,0 
15 SAVI8:THR E220 :N, CA, CG2 , OGl , CB, C, O 

SAVI8:SER E221 : N, CA, OG, CB, O 

SAVI8:MET E222 : N, CA, CE, SD, CG, CB,C, O 

SAVI8:ALA E223:N,CA,CB,C,0 

SAVI8:THR E224 :N , CA, CG2 ,0G1, CB, C, O 
20 SAVI8:PR0 E225 :N, CD, CA, CG, CB, C, 0 

SAVI8:HIS E226:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 
Subset RESTx: 

restxmole. list 
Subset RESTX: 
25 NEWMODEL: E5 , E13-E14 , E22 , E38-E40 , 

E42,E73-E76,E82-E86,E103-E105, 

NEWMODEL: E108 , E122 , E133-E135 , E137-E140^ 
E149-E150,E173,E204,E206, 

NEWMODEL: E2 11-E2 13 , E215-E216 , E227- E229 , 
30 E258,E269 
restxatom. list 
Subset RESTX: 

NEWMODEL: PRO E5 :N, CD, CA, CG, CB, C, O 

NEWMODEL:ALA E13 :N,CA,CB, C,0 
35 NEWMODEL:PRO E14:N,CD,CA,CG,CB,C,0 

NEWMODEL:THR E22 :N, CA, CG2 ,0G1 ,CB, C, 0 

NEWMODEL: THR E38 : N, CA, CG2 ,0G1 , CB , C, O 

NEWMODEL:HIS E39:N,CA,CD2,NE2,CE1,ND1,CG,CB,C,0 

NEWMODEL:PRO E40:N,CD,CA,CG,CB,C,0 
40 NEWM0DEL:LEU E42 :N,CA,CD2 ,CD1,PG,CB,C,0 

NEWMODEL: ALA E73 :N,CA,CB, C,0 

NEWMODEL:ALA E74 :N, CA, CB, C, O 

NEWMODEL: LEU E75 : N , CA , CD2 , CDl , CG , CB , C , O 

NEWMODEL: ASN E76 : N , CA, ND2 , ODl , CG, CB , C , O 
45 NEWMODEL:LEU E82 :N, CA, CD2 , CD1,CG, CB,C,0 

NEWMODEL:GLY E83:N,CA,C,0 

NEWMODEL: VAL E84:N,CA,CG2,CG1,CB,C,0 

NEWMODEL:ALA E85:N,CA,CB, C,0 

NEWM0DEL:PR0 E86:N,CD,CA,CG,CB,C,0 
50 NEWMODEL:SER E103 :N, CA, OG, CB, C, 0 

NEWMODEL:VAL E104:N,CA,CG2,CGl,CB,C,O 

NEWMODEL:SER E105:N,CA,OG,CB,C,0 

NEWMODEL:ALA E108:N,CA,CB,C,0 

NEWMODEL:ALA E122 :N,CA,CB,C,0 
55 NEWMODEL:ALA E133 :N, CA, CB, C, O 

NEWMODEL : THR E134 : N , CA , CG2 , OGl , CB , C , O 

NEWMODEL : LEU El 3 5 : N , CA , CD 2 , CD 1 , CG , CB , C , O 
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10 



15 



20 



NEWMODEL:GLN E137 
NEWMODEL:ALA E138 
NEWMODEL:VAL El 3 9 
NEWMODEL:ASN E140 
NEWMODELrVAL E149 
NEWMODEL:VAL E150 
NEWMODEL:ASN E173 
NEWMODEL:ASN E204 
NEWMODEL:GLN E206 
NEWMODELlGLY E211 
NEWMODEL:SER E212 
NEWMODEL:THR E213 
NEWMODEL:ALA E215 
NEWMODELrSER E216 
NEWMODELzVAL E227 
NEWMODELrALA E228 
NEWMODELrGLY E229 
NEWMODELrGLY E258 
NEWMODEL:ASN E269 



.11, V^,V/ 

:N,CA,OG,CB,C,0 
:N,CA,CG2,0G1,CB,C,0 
:N,CA,CB,C,0 
:N,CA,OG,CB,C,0 
:N,CA,CG2,CG1,CB,C,0 
:N,CA,CB,C,0 
:N,CA,C,0 
:N,CA,C,0 

•N,CA,ND2,0D1,CG,CB,C,0 



Example 3 

Suitable substitutions in PD498 for addition of carboxvlic acid 
attachment groups f-COOH) 

The 3D structure of PD498 was modeled as described in 
25 Example 1. 

Suitable locations for addition of carboxylic attachment groups 
(Aspartatic acids and Glutamic acids) were found as follows. 
The procedure described in Example 1 was followed. The 
commands performed in Insight (BIOSYM) are shown in the command 
30 files makeDEzone.bcl and makeDEzone2.bcl below: 



Conservative substutitions: 

makeDEzone • bcl 

Delete Subset * 
35 Color Molecule Atoms * Specified Specification 255,0,255 

Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer /residue 10 Color_Subset 
255,255,0 

40 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERM : 280:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
45 Zone Subset ACTSITE : 39, 72, 226 Static monpmer/residue 8 

Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 

Combine Subset ALLZONE Union ALLZONE CTERM 

Combine Subset ALLZONE Union ALLZONE ACTSITE 
50 #NOTE: editnextline object name according to the protein 

Combine Subset REST Difference PD498FINALMODEL ALLZONE 
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List Subset REST Atom Output File resta torn. list 
List Subset REST monomer/ residue Output_File restmole.list 
Color Molecule Atoms ACTSITE Specified Specification 255 > 0,0 
List Subset ACTSITE Atom Output File actsiteatom. list 
5 List Subset ACTSITE monomer/ residue Output_File 
actsitemole. list 
# 

Zone Subset REST5A REST Static Monomer/Residue 5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 

10 Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
List Subset SUB5B Atom Output_File subSba torn. list 
List Subset SUB5B monomer/ residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 

15 ... 

#continue with make2one2.bcl. 

#Use grep command to identify asn/gln in restatom.list ... 
#sub5ba torn. list & aces i tea tom. list 

20 Comments: 

The subset REST contains Gln33 and Asn245, SUB5B contains 
Glnl2, Glnl26, Asn209, Gln242, Asn246, Gln248 and Asn266, all 
of which are solvent exposed. 

The substitutions Q12E or Q12D, Q33E or Q33D, Q126E or 
25 Q126D, N209D or N209E, Q242E or Q242D, N245D or N245E, N246D or 
N246E, Q248E or Q248D and N266D or N266E are identified in 
PD498 as sites for mutagenesis within the scope of this 
invention. Residues are substituted below in section 2, and 
further analysis done: 



Non-conservative substitutions: 
makeD£zone2 • bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for ... 

#asn->asp & gln->glu substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace PD498FINALMODEL newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On PD498FINALMODEL 

#NOTE: editnext lines with asn->asp & gln->glu positions 

Replace Residue newmodel: 33 glu L 
45 Replace Residue newmodel: 245 asp L 

Replace Residue newmodel: 12 glu L 

Replace Residue newmodel: 126 glu L 

Replace Residue newmodel: 209 asp L 

Replace Residue newmodel: 242 glu L 
50 Replace Residue newmodel: 246 asp L 

Replace Residue newmodel: 248 glu L 
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Replace Residue newmodel:266 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln~>glu, ••• 
#now including introduced asp & glu 
5 Color Molecule Atoms newmodel Specified Specification 255,0,255 
Zone Subset ASPx nev7inodel:asp:od* Static monomer /residue 10 
Color__Subset 255,255,0 

Zone Subset GLUx newmodel: glu :oe* Static monomer /residue 10 
Color_Subset 255,255,0 
10 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel : 280 :0 Static monomer/residue 10 
Color_Subset 255,255,0 

#N0TE: editnextline ACTSITEx residues according to the protein 
15 Zone Subset ACTSITEx newmodel: 39 , 72 , 226 Static monomer/residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
20 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output_FiIe restxatom. list 

List Subset RESTx monomer /residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
25 List Subset ACTSITEx Atom Output File actsitexatom. list 
List Subset ACTSITEx monomer /residue Output_File 
act sitexmole. list 
# 

#read restxatom. list or restxmole. list to identify sites for 
30 (not_gluasp) ->gluasp . . . 
#subst. if needed 

Comments : 

The subset RESTx contains only two residues: A233 and G234, 

35 none of which are solvent exposed. No further mutagenesis is 

required to obtain complete protection of the surface. 

However, it may be necessary to remove some of the reactive 

carboxylic groups in the active site region to ensure access to 

the active site of PD498. Acidic residues within the subset 

40 ACTSITE are: D39, D58, D68 and D106. Of these only the two 

latter are solvent exposed and D39 is a functional residue. The 

mutations D68N, D68Q, D106N and D106Q were found suitable 

according to the present invention. 

Relevant data for Example 3: 

45 Solvent accessibility data for PD498MODEL: see Example 1 above. 

Subset REST: 

restmole. list 
Subset REST: 

PD498FINALMODEL: 10-11, 33-35, 54-55, 129-130, 
50 221,23 3-234,236, 24 0,243, 

PD498FINALMODEL: 245,262 ,264-265 
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restatom. list 
Subset REST: 

PD498FINALMODEL:ALA 10:N,CA,C,0,CB 
5 PD498FINALMODEL:TYR 11 :N, CA, C, 0,CB, CG, CDl, CD2 , CEl, CE2 , CZ,OH 
PD498FINALMODEL:GLN 33 :N,CA, C,0,CB,CG,CD,0E1,NE2 
PD4 9 8 FINALMODEL : THR 3 4 : N , CA , C , O , CB , OGl , CG2 
PD498FINALMODEL:VAL 35:N,CA,C,0,CB,CG1,CG2 
PD4 9 8FINALM0DEL : ILE 54 : N , CA , C , O , CB , CGI , CG2 , GDI 
10 PD498FINALMODEL:LYS 55:N,CA,C,0,CB,CG,CD,CE,NZ 
PD498FINALMODEL:LYS 129 :N, CA, C, 0, CB, CG, CD, CE,NZ 
PD4 9 8 FINALMODEL :VAL 130 :N, CA, C, CB, CG1,CG2 

PD498FINALMODEL:TYR 221:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD498FINALMODEL:ALA 233:N,CA,C,0,CB 
15 PD4 9 8 FINALMODEL :GLY 234:N,CA,C,0 

PD498FINALMODEL:ALA 236 :N, CA, C,0, CB 

PD4 9 8 FINALMODEL: ALA 240 :N, CA, C,0, CB 

PD4 9 8 FINALMODEL :GLY 243:N,CA,C,0 

PD4 9 8 FINALMODEL :ASN 245:N,CA,C,0,CB,CG,0D1,ND2 
20 PD4 9 8 FINALMODEL :GLY 262:N,CA,C,0 

PD4 9 8 FINALMODEL :GLY 264:N,CA,C,0 

PD498FINALMODEL:THR 265:N,CA,C,0,CB,0G1,CG2 
Subset SUB5B: 
sub5binole . list 
25 Subset SUB5B: 

PD498FINALMODEL: 6-9 , 12-13 ,31-32 , 51-53 , 56,81, 93-94 , 97- 

99,122,126-128, 

PD498FINALMODEL: 131 , 155-157 , 159 , 197-199 , 209 , 211 , 219- 
220,232,235, 

30 PD498FINALMODEL:237-239, 241-242, 244, 246-249, 253,260- 

261,263,266-268 
sub5batoin. list 

Subset SUB5B: 

PD4 9 8 FINALMODEL : PRO 6 : N , CA , CD , C , 0 , CB , CG 
35 PD498FINALMODEL:TYR 7 :N, CA, C, 0,CB, CG, CDl, CD2 , CEl, CE2 , CZ,OH 

PD4 9 8 FINALMODEL :TYR 8:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

PD4 9 8FINALM0DEL : SER 9 : N , CA , C , 0 , CB , OG 

PD498FINALMODEL:GLN 12 : N, CA, C, 0, CB, CG, CD, OEl ,NE2 

PD498FINALMODEL : TYR 13 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , CZ , OH 
40 PD498FINALMODEL:SER 31:N, CA,C,0,CB,,OG 

PD4 9 8 F INALMODEL : THR 3 2 : N , CA , C , O , CB , OG 1 , CG2 

PD498FINALMODEL:ARG 51 : N, CA, C, O, CB, CG, CD, NE, CZ ,NH1 ,NH2 

PD4 9 8 FINALMODEL : LYS 5 2 : N , CA , C , O , CB , CG , CD , CE , NZ 

PD4 98FINALMODEL : VAL 53 : N , CA , C , 0 , CB , CGI , CG2 
45 PD4 9 8 FINALMODEL :GLY 56:N,CA,C,0 

PD498FINALMODEL:ALA 81:N, CA, C,0, CB 

PD4 9 8 FINALMODEL: MET 93 : N, CA, C,0, CB, CG, SD, CE 

PD498FINALMODEL:ALA 94 :N, CA, C,0, CB 

PD4 9 8 FINALMODEL : THR 9 7 : N , CA , C , 0 , CB , OG 1 , CG2 
50 PD498FINALMODEL:LYS 98:N,CA,C,0,CB,CG,CD,CE,NZ 

PD4 9 8FINALM0DEL : ILE 9 9 : N , CA , C , O , CB , CGI , CG2 , CDl 

PD4 98FINALMODEL : TYR 12 2 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , CZ , OH 

PD4 9 8 FINALMODEL : GLN 12 6 : N , CA , C , O , CB , CG , CD , OEl , NE2 

PD4 9 8 FINALMODEL :GLY 127:N,CA,C,0 
55 PD4 9 8 FINALMODEL: ALA 128 :N, CA, C, O, CB 

PD498FINALMODEL:LEU 131:N,CA,C,0,CB,CG,CD1,CD2 

PD498FINALMODEL:GLY 155:N,CA,C,0 
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PD498FINALMODEL:ALA 156 
PD498FINALMODEL:VAL 157 
PD498FINALM0DEL:VAL 159 
PD498FINALMODEL:TYR 197 
5 PD498FINALMODEL:GLY 198 
PD498FINALMODEL:THR 199 
PD498FINALMODEL:ASN 2 09 
PD498FINALMODEL:ALA 211 
PD498FINAIiMODEL:TYR 219 
10 PD498FINAIiMODEL:SER 220 
PD498FINA3jMODEL:VAL 232 
PD498FINAIiMODEL:LEU 235 
PD498FINALM0DEL:ALA 237 
PD498FINALMODEL:LEU 238 
15 PD498FINALMODEL:LEU 239 
PD498FINALMODEL:SER 241 
PD498FINALMODEL:GLN 24 2 
PD498FINALMODEL:LYS 244 
PD498FINALMODEL:ASN 246 
20 PD498FINALMODEL:VAL 247 
PD4 98 FINALMODEL : GLN 248 
PD498FINALMODEL:ILE 249 
PD498FINALMODEL:ILE 253 
PD4 9 8 FINALMODEL :ILE 260 
25 PD4 9 8 FINALMODEL :SER 261 
PD498FINALM0DEL:THR 263 
PD4 9 8 FINALMODEL :ASN 266 
PD498FINALMODEL:PHE 267 
PD498FINALMODEL:LYS 268 
30 Subset ACTSITE: 

actsitemole. list 
Subset ACTSITE: 

PD498FINALMODEL: 3 6-42, 57-60, 66-80, 100-110, 
115-116,119,132-13 6,160-164, 
35 PD498FINALMODEL: 182-184 , 194 , 206-207 , 210 , 

212-215,222-231 
actsiteatom. list 
Subset ACTSITE: 

PD498FINALMODEL:ALA 36:N,CA,C,0,GB 
40 PD4 9 8 FINALMODEL :VAL 37 :N, CA, C, 0, CB, CGI, CG2 

PD4 9 8 FINALMODEL: LEU 38 : N, CA, C , O , CB , CG , CDl , CD2 
PD4 9 8 FINALMODEL : ASP 3 9 : N , CA , C , 0 , CB , CG , ODl , 0D2 
PD4 9 8 FINALMODEL : SER 4 0 : N , CA , C , O , CB ,,0G 
PD498FINALM0DEL:GLY 4l:N,CA,C,0 
45 PD4 9 8 FINALMODEL :VAL 42:N,CA,C,0,CB,CG1,CG2 

PD4 9 8 FINALMODEL : TYR 

57:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
PD4 9 8 FINALMODEL : ASP 5 8 : N , CA , C , O , CB , CG , ODl , 0D2 
PD4 9 8 F INALMODEL : PHE 
50 59:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

PD498FINALMODEL: ILE 60 : N , CA, C , O , CB, CGI , CG2 , CDl 
PD498FINALMODEL: PRO 66 : N , CA , CD , C , O , CB , CG 
PD498FINALMODEL:MET 67 : N , CA , C , O , CB , CG , SD , CE 
PD4 9 8 FINALMODEL : ASP 68 : N , CA , C , O , CB , CG , ODl , 0D2 
55 PD4 9 8 FINALMODEL: LEU 69 :N, CA, C,0, CB, CG,CD1 ,CD2 

PD4 9 8 FINALMODEL : ASN 7 0 : N , CA , C , O , CB , CG , OD 1 , ND2 
PD498FINALMODEL:GLY 71:N,CA,C,0 





ca) 




0, 


CB,CG1,CG2 








N, 


CA, 


c, 


0, 


CB,CG1,CG2 








N, 


CA, 


c, 


0, 


CB , CG , CDl , CD2 , CEl , 


CE2 


,CZ, 


OH 


N 


CA, 


c, 


0 










N 


CA, 




0, 


CB,0G1,CG2 








N 


CA, 


c 


0, 


CB,CG,0D1,ND2 








TJ 




C 


o 


CB 












n 


o 


CB CG - CDl . CD2 * CEl . 


CE2 


,cz. 


OH 






p 


J 


CB OG 








n 1 


CA 


n 
v.* , 


n 


CB CGI CG2 










CA 


p 




PR CG CnT.CD2 










PA 


p 


n 

f ^ 1 


CB 








N 


CA 


Ci 


r 0| 


CB , CG , CDl , CD2 








N 


CA 


c 


r 0, 


CB, CG, CDl, CD2 








N 


CA, 


Q 




CB,OG 








N. 


CA, 


Cj 


fO, 


, CB , CG , CD , OEl , NE2 








Nj 


CA, 


Cj 


rO, 


,CB,CG,CD,CE,NZ 








Ni 


r CA, 


Cj 


fOi 


,CB,CG,0D1,ND2 








N 


CA, 




fOj 


f CB , CGI , CG2 








N 


f CA, 


fCj 


fO] 


, CB , CG , CD , OEl , NE2 










rCA, 




fO 


,CB,CG1,CG2,CD1 










rCA, 


pC, 


rO 


,CB,CG1,CG2,CD1 








N| 


fCA, 


rC 


fO 


,CB,CG1,CG2,CD1 








Ni 


rCA, 


pC 


fO, 


,CB,OG 








N 


rCA, 


rC 


rO, 


,CB,0G1,CG2 








N 


rCA, 


rC 


rO 


^CB,CG,0D1,ND2 








N 


rCA, 


fC 


rO 


, CB , CG , CDl , CD2 , CEl , 


CE2 


,cz 




N 


,CA 


fC 


fO 


,CB,CG,CD,CE,NZ 
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PD4 9 8FINALMODEL : HIS 72 : N , CA , C , O , CB , CG , NDl , CD2 , CEl ; NE2 
PD498FINALMODEL:GLY 73:N,CA,C,0 
PD498FINALMODEL:THR 74 : N, CA, C, 0, CB, OGl , CG2 
PD498FINALMODEL:HIS 75:N,CA,C,0,CB;CG,ND1,CD2,CE1,NE2 
5 PD498FINALMODEL:VAL 76:N,CA,C,0,CB,CG1,CG2 

PD498FINALMODEL:ALA 77 :N, CA, C, 0, CB 
PD498FINALMODEL:GLY 78:N,CA,C,0 
PD4 9 8 FINALMODEL : THR 7 9 : N , CA , C , O , CB , OGl , CG2 
PD4 9 8 FINALMODEL : VAL 8 0 : N , CA , C , O , CB , CGI , CG2 



10 


PD4 9 8 FINALMODEL : LEU 


100: 




CA, 


C,0,CB, 


CG, 


CDl , CD2 




PD4 9 8 FINALMODEL : ALA 


101: 




CA, 


C,0,CB 








PD4 9 8 FINALMODEL : VAL 


102: 




CA, 


C,0,CB, 


CGI 


,CG2 




PD4 9 8FINALMODEL : ARG 


103:N, 


CA, 


C,0,CB, 








CG,CD,NE,CZ,NH1 


,NH2 












15 


PD4 9 8FINALM0DEL : VAL 


104: 


N, 


CA, 


C,0,CB, 


CGI 


,CG2 




PD4 9 8FINALM0DEL : LEU 


105: 


N/ 


CA, 


C,0,CB, 


CG, 


CD1,CD2 




PD4 9 8FINALM0DEL : ASP 


106: 


N, 


CA, 


C,0,CB, 


CG, 


0D1,0D2 




PD4 9 8FINALM0DEL : ALA 


107: 


N, 


CA, 


C,0,CB 








PD4 9 8 FINALMODEL : ASN 


108: 


N, 


CA, 


C,0,CB, 


CG, 


0D1,ND2 


20 


PD498FINALMODEL: GLY 


109: 


N, 


CA, 


C,0 








PD498FINALMODEL: SER 


110: 


N, 


CA, 


C,0,CB, 


OG 






PD498FINALMODEL: SER 


115: 


N, 


CA, 


C,0,CB, 


OG 






PD4 9 8 FINALMODEL: ILE 


116: 


N, 


CA, 


C,0,CB, 








CG1,CG2;CD1 














25 


PD498FINAIiM0DEL: GLY 


119: 


N, 


CA, 


C,0 








PD4 9 8 FINALMODEL : ASN 


132: 


N, 


CA, 


C,0,CB, 


CG, 


0D1,ND2 




PD4 9 8 FINALMODEL : LEU 


133: 


N, 


CA, 


C,0,CB, 


CG, 


CD1,CD2 




PD498FINALMODEL: SER 


134: 


N, 


CA, 


C,0,CB, 


OG 






PD4 9 8 FINALMODEL : LEU 


135: 


N, 


CA, 


C,0,CB, 


CG, 


CD1,CD2 


30 


PD4 9 8 FINALMODEL : GLY 


136: 


N| 


CA, 


c,o 








PD4 9 8 FINALMODEL : ALA 


160: 


N, 


CA, 


C,0,CB 








PD4 9 8 FINALMODEL : ALA 


161: 


N, 


CA, 


C,0,CB 








PD4 9 8 FINALMODEL : ALA 


162: 


N, 


CA, 


C,0,CB 








PD4 9 8 FINALMODEL : GLY 


163: 


N, 


CA, 


C,0 






35 


PD4 9 8 FINALMODEL : ASN 


164: 


N, 


pCA, 


C,0,CB, 


CG, 


0D1,ND2 




PD4 9 8 FINALMODEL : VAL 


182: 


.N, 


rCA, 


C,0,CB, 


CGI 


,CG2 




PD4 9 8 FINALMODEL : GLY 


183: 


N, 


.CA, 


rC,0 








PD4 9 8 FINALMODEL : ALA 


184: 


:N, 


pCA, 


,C,0,CB 








PD498FINALMODEL:PHE 


194: 


:N, 


rCA, 


,C,0,CB, 






40 


CG,CD1,CD2,CE1, 


CE2, 


CZ 












PD4 9 8 FINALMODEL : PRO 


206: 


:N, 


rCA, 


,CD,C,0, 


CB, 


CG 




PD4 9 8 FINALMODEL : GLY 


207; 


;N, 


,CA 


fC,0 








PD498FINALMODEL: ILE 


210:N, 


rCA, 


pC,0,CB, 








CGI , CG2 , CDl 














45 


PD4 9 8 FINALMODEL: SER 


212 


:N 


,CA 


^C,0,CB, 


OG 






PD498FINALMODEL : THR 


213 


:N 


,CA 


,C,0,CB, 


OGl 


.,CG2 




PD4 9 8FINALM0DEL : VAL 


214 


:N 


,CA 


,C,0,CB, 


CGI 


.,CG2 




PD4 9 8 FINALMODEL : PRO 


215 


:N 


,CA 


^ CD , C , 0 , 


CB, 


CG 




PD4 9 8 FINALMODEL : MET 


222 


:N 


,CA 


,C,0,CB, 


CG, 


SD,CE 


50 


PD4 9 8 FINALMODEL : S ER 


223 


:N 


,CA 


,C,0,CB, 


OG 






PD4 9 8 FINALMODEL : GLY 


224 


:N 


,CA 


fC,0 








PD4 9 8FINALM0DEL : THR 


225 


:N 


,CA 


,C,0,CB, 


0G1,CG2 




PD498FINALM.ODEL: SER 


226 


:N 


,CA 


,C,0,CB, 


OG 






PD4 9 8FINALM0DEL : MET 


227 


:N 


,CA 


,C,0,CB, 


CG, 


SD,CE 


55 


PD4 98 FINALMODEL : ALA 


228 


:N 


,CA 


,C,0,CB 








PD4 9 8 FINALMODEL: SER 


229 


:N 


,CA 


,C,0,CB, 


OG 






PD4 9 8 FINALMODEL: PRO 


230 


:N 


,CA 


, CD , C , 0 , 


CB, 


CG 



- wo 98/35026 



PCT/DK98/00046 



73 

PD498FINALMODEL:HIS 231:N,CA,C,0,CB, 
CG , NDl , CD2 , CEl , NE2 
Subset RESTx: 

restxmole. list 
5 Subset RESTX: 

NEWMODEL: 233-234 
restxatom. list 
Subset RESTX: 

NEWMODEL:ALA 233:N,CA,C,0,CB 
10 NEWMODEL:GLY 234:N,CA,C,0 

Example 4 

Suitable substitutions in the Arthromyces ramosus peroxidase 
for addition of carbbxvlic acid attachment groups (-COOH) 
15 Suitable locations for addition of carboxylic attachment 
groups (Aspartatic acids and Glutamic acids) in a non- 
hydrolytic enzyme, Arthromyces ramosus peroxidase were found as 
follows • 

The 3D structure of this oxido-reductase is available in the 
20 Brookhaven Databank as larp.pdb. This A. ramosus peroxidase 
contains 344 amino acid residues. The first eight residues are 
not visible in the X-ray structure: QGPGGGGG, and N143 is 
glycosylated. 

The procedure described in Example 1 was followed. 
25 The amino acid sequence of Arthromyces ramosus Peroxidase 

(E.C.1.11.1.7) is shown in SEQ ID NO 4. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeDEzone.bcl and makeDEzone2 . bcl below. The C- 

terminal residue is P344, the ACTSITE is defined as the heme 
30 group and the two histidines coordinating it (H56 & H184) . 

Conservative substitutions: 

makeDEzone • bcl 

Delete Subset * 

Color Molecule Atoms * Specified Specification 255,0,255 
35 Zone Subset ASP :asp:od* Static monomer /residue 10 Color_Subset 
255,255,0 

Zone Subset GLU :glu:oe* Static monomer/ residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline C-terminal residue number according to the 
40 protein 

Zone Subset CTERM : 344:0 Static monomer /residue 10 Color_Subset 
255,255,0 

#NOTE: editnextline ACTSITE residues according to the protein 
Zone Subset ACTSITE :HEM,56,184 Static monomer /residue 8 
45 Color_Subset 255,255,0 

Combine Subset ALLZONE Union ASP GLU 
Combine Subset ALLZONE Union ALLZONE CTERM 
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Combine Subset ALLZONE Union ALLZONE ACTSITE 
#NOTE: editnextline object name according to the protein 
Combine Subset REST Difference ARP ALLZONE 
List Subset REST Atom Output File restatom. list 
5 List Subset REST monomer/residue Output_File restmole. list 
Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
List Subset ACTSITE Atom Output File actsiteatom, list 
List Subset ACTSITE monomer /residue Output_File 
actsitemole. list 
10 # 

Zone Subset REST5A REST Static Monomer/Residue .5 -Color_Subset 
Combine Subset SUB5A Difference REST5A ACTSITE 
Combine Subset SUB5B Difference SUB5A REST 

Color Molecule Atoms SUB5B Specified Specification 255,255,255 
15 List Subset SUB5B Atom Output_File sub5batom. list 

List Subset SUB5B monomer /residue Output_File subSbmole. list 
#Now identify sites for asn->asp & gln->glu substitutions and 
• • « 

#continue with ma)cezone2.bcl. 
20 #Use grep command to identify asn/gln in restatom. list 
#sub5batom. list & accsiteatom. list 

Comments : 

The subset REST contains Gln70, and SUB5B contains Gln34, 
25 Asnl28, Asn303 all of which are solvent exposed. The 

substitutions Q34E or Q34D, Q70E or Q70D, N128D or N128E and 
N303D or N303E are identified in A. ramosus peroxidase as sites 
for mutagenesis. Residues are substituted below and further 
analysis done: 

30 

Non-conservative substitutions: 
makeD£zone2 . bcl 

#sourcefile makezone2 .bcl Claus von der Osten 961128 
# 

35 #having scanned lists (grep gln/asn command) and identified 
sites for . . . 

#asn->asp & gln->glu substitutions 

#NOTE: editnextline object name according to protein 
Copy Object -To_Clipboard -Displace ARP newmodel 
40 Biopolymer 

#NOTE: editnextline object name according to protein 
Blank Object On ARP 

#NOTE: editnextlines with asn->asp & gln->glu positions 
Replace Residue newmodel: 34 glu L 
45 Replace Residue newmodel: 70 glu L 
Replace Residue newmodel: 128 asp L 
Replace Residue newmodel: 303 asp L 
# 

#Now repeat analysis done prior to asn->asp & gln->glu, ... 
50 #now including introduced asp & glu 

Color Molecule Atoms newmodel Specified Specification 255,0,255 
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Zone Subset ASPx newmodel:asp:od* Static monomer /residue 10 
Color_Subset 255,255^0 

Zone Subset GLUx newmodel:glu:oe* Static monomer/residue 10 
Color_Subset 255,255,0 
5 #NOTE: editnextline C-terminal residue number according to the 
protein 

Zone Subset CTERMx newmodel: 344 :0 Static monomer /residue 10 
Color_Subset 255,255,0 

#NOTE: editnextline ACTSITEx residues according to the protein 
10 Zone Subset ACTSITEx newmodel: HEM, 56, 184 Static monomer/residue 

8 Color_Subset 255,255,0 

Combine Subset ALLZONEx Union ASPx GLUx 

Combine Subset ALLZONEx Union ALLZONEx CTERMx 

Combine Subset ALLZONEx Union ALLZONEx ACTSITEx 
15 Combine Subset RESTx Difference newmodel ALLZONEx 

List Subset RESTx Atom Output File restxatom. list 

List Subset RESTx monomer/residue Output_File restxmole. list 

# 

Color Molecule Atoms ACTSITEx Specified Specification 255,0,0 
20 List Subset ACTSITEx Atom Output File actsitexatom. list 
List Subset ACTSITEx monomer/ residue Output_File 
actsitexmole . list 
# 

#read restxatom. list or restxmole. list to identify sites for 
25 (not_gluasp) ->gluasp - . . 
#subst. if needed 

Comments : 

The subset RESTx contains only four residues: S9, S334, G335 

30 and P336, all of which are >5% solvent exposed. The mutations 
S9D, S9E, S334D, S334E, G335D, G335E, P336D and P336E are 
proposed in A. ramosus peroxidase. Acidic residues within the 
subset ACTSITE are: E44, D57, D77, E87, E176, D179, E190, D202, 
D209, D246 and the N-terminal carboxylic acid on P344. Of these 

35 only E44, D77, E17 6, D179, E190, D209, D246 and the N-terminal 
carboxylic acid on P344 are solvent exposed. Suitable sites for 
mutations are E44Q, D77N, E176Q, D179N, E190Q, D209N and D246N. 
D246N and D246E are risky mutations due to D246's importance 
for binding of heme. 

40 The N-terminal 8 residues were not included in the 

calculations above, as they do not appear in the structure. 
None of these 8 residues, QGPGGGG, contain carboxylic groups. 
The following variants are proposed as possible mutations to 
enable attachment to this region: QlE, QID, G2E, G2D, P3E, P3D, 

45 G4E, G4D, G5E, G5D, G6E, G6D, G7E, G7D, G8E, G8D. 
Relevant data for Example 4: 
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Solvent accessibility data for A. ramosus peroxidase (Note: 
as the first eight residues are missing in the X-ray structure, 
the residue nximbers printed in the accessibility list below are 
8 lower than those used elsewhere for residue numbering • 
5 # ARP Thu Jan 30 15:39:05 MET 1997 





# residue 


area 




SER 


1 


143.698257 




val" 


2 


54.879990 




thr"" 


■3 


86.932701 


10 


CYS" 


■4 


8.303715 




pro" 


■5 


126.854782 




gly" 


"6 


53.771488 




GLY 


■7 


48.137802 




GUf 


"8 


62.288475 


15 


SER 


'9 


79.932549 




thr" 


"lO 


16.299215 




ser" 


"11 


81.928642 




ASN 


"12 


51.432678 




ser" 


13 


81.993019 


20 


GLN 


14 


92.344009 




CYS" 


"15 


0.000000 




CYS" 


"16 


32.317432 




val" 


"17 


54.067810 




TRP" 


"18 


6.451035 


25 


phe" 


"19 


25.852070 




asp" 


"20 


79.033997 




val' 


"21 


0.268693 




leu" 


'22 


22.032858 




asp" 


"23 


90.111404 


30 


asp" 


"24 


43.993240 




leu" 


"25 


1.074774 




GLN" 


"26 


25.589321 




thr" 


'27 


82.698059 




asn" 


"28 


96.600883 


35 


phe" 


"29 


32.375275 




tyr" 


"30 


5.898365 




GLN 


"31 


103.380585 




gly" 


'32 


40.042034 






■33 


46.789322 


40 


LYS 


34 


87.161873 




CYS" 


■35 


12 .827215 




GLU" 


"36 


51.582657 




SER" 


"37 


16.378180 




pro" 


"38 


33.560043 


45 


val" 


■39 


6.448641 




arg" 


"40 


7.068311 




lys" 


"41 


15.291286 




ile" 


"42 


1.612160 




leu" 


"43 


1.880854 


50 


arg" 


"44 


16.906845 




ile" 


■45 


0.000000 




val" 


"46 


2.312647 




phe"" 


"47 


2.955627 




his" 


"48 


20.392527 


55 


asp" 


"49 


4.238116 
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ALA 


50 


0.510757 






51 


1-576962 




gly" 


52 


2.858601 




PHE 


"53 


48.633503 


5 


ser" 


54 


8.973248 




pro" 


"55 


58.822315 




ALA 


56 


59.782852 




LEU^ 


57 


46.483955 




THR 58 


86.744827 


10 


ALA 59 


89.515816 




ALA 


60 


81.163239 




gly" 


'61 


70.119019 




gln" 


'62 


112.635498 




PHE 


"63 


93.522354 


15 


gly" 


"64 


2.742587 




gly" 


"65 


13.379636 




GLY 


66 


22.722847 




gly" 


"67 


0.000000 




ALA" 


"68 


0.268693 


20 


asp" 


"69 


12.074840 




gly 


"70 


0.700486 




ser" 


'71 


0.000000 




ile" 


72 


0.000000 




ile" 


'73 


0.000000 


25 


ala" 


'74 


17.304443 




his" 


"75 


41.071186 




ser" 


"76 


20.000793 




asn" 


"77 


120.855316 




ile" 


'78 


66.574982 


30 


GLU" 


"79 


2.334954 




leu" 


"80 


41.329689 




ala' 


"81 


77.370575 




PHE 


"82 


38.758774 




pro" 


"83 


131.946289 


35 


ala" 


"84 


34.893864 




asn" 


"85 


5.457000 




gly" 


"86 


43.364151 




GLY 


"87 


51.561348 




LEU^ 


>8 


0.242063 


40 


THR 


89 


73.343575 




asp" 


"90 


130.139389 




thr" 


"91 


17.863211 




ile" 


"92 


0.268693 




GLU" 


"93 


92.210396 


45 


ALA" 


"94 


35.445068 




leu" 


"95 


1.343467 




arg" 


"96 


31.175611 




ala" 


"97 


44.650192 




val" 


"98 


17.698566 


50 


gly" 


"99 


1 . 471369 




ile' 


"lOO 


62.441463 




asn" 


"lOl 


107.139748 




his" 


"102 


46.952496 




gly" 


"103 


46.559296 


55 


val' 


"l04 


11.342628 




ser' 


"105 


15.225677 




phe" 


"106 


6.422011 
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GLY 


107 


3,426864 




ASP 


108 


10.740790 




LEU 


109 


0.268693 




ILE 


110 


1.880854 


5 


GLN 


111 


31.867456 




PHE 


112 


0.000000 




ALA 


113 


0.000000 




THR 


114 


3.656114 




ALA 


115 


8.299393 


10 


val" 


116 


0.268693 




GLY 


117 


0.268693 




MET 


118 


3.761708 




SER 


119 


14.536770 




ASN 


120 


25.928799 


15 


CYS 


"121 


0.537387 




PRO 


122 


29.798336 




GLY" 


123 


33.080013 




SER" 


124 


17.115562 




pro" 


"125 


36.908714 


20 


ARG 


"126 


108.274727 




LEU" 


'127 


21.238588 




GLU" 


"128 


53.742313 




PHE 


"129 


3.761708 




LEU 


'13 0 


12.928699 


25 


thr" 


"131 


10.414591 




GLY' 


"132 


47.266495 




arg" 


"l33 


12 .247048 




SER- 


"134 


63 .047237 




ASN" 


"135 


31.403708 


30 


SER^ 


"136 


97.999619 




SER*" 


"137 


28.505201 




GLN" 


"138 


102.845520 




PRO^ 


"139 


49.691917 




ser" 


"140 


9.423104 


35 


pro" 


"141 


25.724171 




pro" 


"142 


80.706665 




ser" 


"143 


105.318176 




leu" 


"144 


20.154398 




ile" 


"145 


41.288322 


40 


pro" 


"146 


10.462679 




gly' 


147 


19.803421 




pro" 


148 


18.130360 




GLY' 


"149 


47.391853 




ASN 


"150 


60.248917 


45 


thr" 


"151 


87.887985 




val" 


"152 


13.870322 




thr" 


"153 


74.664734 




ALA" 


154 


45.251106 




ile" 


"155 


2.686934 


50 


leu 


"156 


28.720940 




asp" 


"157 


110.081253 




ARG 


"158 


31.228874 




MET 


159 


1.612160 




GLY" 


160 


38.223858 


55 


asp' 


"161 


46.293152 




ALA" 


162 


9.877204 




GLY" 


163 


34.267326 
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PHE_ 


164 


11.057570 




ser" 


165 


51.158882 




PRO 


166 


62.767738 




asp"" 


167 


75.164917 


5 


GLU 


168 


43.334976 




VAL 


169 


6.365355 




val" 


170 


2.955627 




asp" 


171 


7.004863 




LEU 


172 


1.880854 


10 


leu" 


173 


3.197691 




ALA 


174 


0.000000 




ALA 


175 


1.074774 




his" 


176 


0.502189 




SER 


177 


0.806080 


15 


leu" 


178 


3.197691 




ALA 


179 


3.337480 




SER 


180 


0.466991 




gln" 


181 


2.122917 




GLU" 


182 


40.996552 


20 


GLY 


183 


62.098671 




LEU 


'184 


23.954853 




ASN 


185 


15.918136 




SER- 


'186 


95.185318 




ALA" 


'187 


59.075272 


25 


ILE 


'188 


27.675419 




phe" 


189 


102.799423 




arg" 


'190 


55.265549 




ser" 


"191 


6.986028 




pro" 


"192 


2.686934 


30 


leu" 


'193 


12.321225 




asp" 


"194 


2.127163 




ser" 


"195 


33.556419 




thr" 


"196 


33.049286 




pro" 


'197 


20.874798 


35 


gln" 


'198 


65.729698 




val" 


"199 


31.705818 




phe" 


"200 


4.753195 




asp" 


"201 


13.744506 




thr" 


"202 


1.612160 


40 


gln" 


"203 


16.081930 




phe" 


"204 


2.581340 




TYR 


"205 


1.880854 




ile" 


"206 


9.356181 




GLU^ 


"207 


0.735684 


45 


thr" 


"208 


10.685907 




leu" 


"209 


9.672962 




leu" 


"210 


2.955627 




LYS" 


'211 


77.176834 




gly" 


"212 


40.968609 


50 




Z 1 J 


Jo. 71o2Xd 




THR- 


"214 


21.738384 




GLN 


"215 


77.622299 




pro" 


'216 


25.441587 




gly" 


"217 


8.320850 


55 


pro" 


218 


96.972305 




ser" 


"219 


64.627823 




leu" 


"220 


85.732414 
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GLY 221 


27.361111 




PHE 222 


134.620178 




ALA 


223 


3.873014 




GLU 


224 


12.141763 


5 


GLU" 


225 


65.129868 




leu" 


'226 


76.105843 




ser" 


227 


0.268693 




pro"" 


228 


7.017754 




phe" 


229 


0.000000 


10 


pro" 


230 


47.827423 




gly" 


"231 


23.790522 




GLU" 


232 


6.643466 




PHE 


233 


6.713862 




ARG 


234 


18.012030 


15 


met"" 


235 


4.598188 




ARG 


"236 


91.415581 




ser"" 


'237 


1.982125 




asp" 


'238 


6.246871 




ALA" 


'239 


12.897283 


20 


LEU 


'240 


76.820526 




LEU 


'241 


3.224321 




ALA"" 


'242 


1.400973 




arg" 


'243 


77.207176 




asp" 


"244 


36.207306 


25 


ser"" 


"245 


104.023796 




ARG 


"246 


121.852341 




thr" 


'247 


2.955627 




ALA" 


"248 


4.810700 




CYS" 


"249 


47.331306 


30 


arg]; 


"250 


62.062778 




trp" 


"251 


2.418241 




gln"" 


"252 


5.554953 




ser" 


"253 


38.284832 




met" 


"254 


1.124224 


35 


thr" 


"255 


0.000000 




ser" 


"256 


53.758987 




ser" 


"257 


37.276134 




asn" 


"258 


44.381340 




glu" 


"259 


149.565140 


40 


val" 


"260 


57.500389 




met' 


261 


2.679314 




gly" 


262 


10.175152 




gln" 


"263 


107.458916 




arg" 


"264 


36.402130 


45 


tyr" 


"265 


0.233495 




arg' 


"266 


91.179619 




ALA" 


2267 


53.708500 




ALA 




6.504294 




MET 


269 


17.122011 


50 


ALA" 


270 


22 . 455158 




LYS' 


271 


73.386177 




MET 


"272 


3.959508 




ser" 


"273 


15.043281 




VAL 


"274 


23 .887930 


55 


LEU 


"275 


17.196379 




gly' 


276 


44.362202 




PHE 


277 


68.062485 
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ASP 


278 


94.902039 




arg" 


279 


113.549011 




ASN 


"280 


134,886017 




ALA 


"281 


72.340973 


5 


LEU 


"282 


26.692348 




thr" 


"283 


27.696728 




ASP 


"284 


72.214157 




CYS]] 


"285 


0.000000 




ser" 


286 


28.209335 


10 


asp"" 


287 


64.560753 




val" 


'288 


7.040061 




ILE 


"289 


8.665112 




pro" 


"290 


48.682365 




ser" 


291 


86.141670 


15 


ALA" 


"292 


29.031240 




val" 


'293 


84.432014 




ser" 


"294 


85.944153 




asn" 


'295 


49.017288 




asn" 


'296 


133.459198 


20 


ALA"" 


'297 


57.283794 




ALA' 


"298 


65.233749 




PRO 299 


24.751518 




VAL 300 


45.409184 




ILE 


301 


8.060802 


25 


pro" 


'302 


14.742939 




gly" 


[303 


16.589832 




GLY 


304 


34.238071 




leu" 


"305 


24.719791 




thr" 


"306 


49.356300 


30 


val" 


"307 


71.491821 




asp" 


"308 


130.906174 




asp" 


"309 


31.733070 




ile" 


"310 


19.581894 




GLU" 


"311 


81.414574 


35 


val" 


"312 


94.769890 




ser" 


"313 


39.688896 




CYS" 


"314 


9.998511 




pro" 


"315 


120.328018 




ser" 


"316 


95.364319 


40 


GLU" 


"317 


65.560959 




pro" 


"318 


100.254364 




phe" 


"319 


46.284115 




pro" 


"320 


31.328060 




GLU" 


"321 


177.602249 


45 


ILE" 


"322 


33.449741 




ALA" 


"323 


46.892982 




thr" 


"324 


79.976471 




ALA" 


"325 


36.423820 




ser" 


"326 


124.467422 


50 


gly" 


"327 


28.219524 




pro" 


"328 


107.553696 




leu" 


"329 


86.789825 




pro" 


"330 


34.287163 




ser" 


"331 


75.764053 


55 


LEU 


"332 


32.840569 




ALA" 


"333 


61.516434 




pro" 


"334 


82.389992 
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6.246871 

56.750813 

60.435017 

2.078997 

0.000000 

141.534668 

186.311371 



ALA_335 
PRO_33 6 
HEM_337 
CA_338 
5 CA_339 
NAG_340 
NAG_341 
Subset REST: 

restiaole.list 
10 Subset REST: 

ARP: 9, 69-70, 125, 127, 133,299-301,334-336 
restatom. list 
Subset REST: 

ARP:SER 9:N,CA,C,0,CB,0G 
15 ARP:GLY 69:N,CA,C,0 

ARP:GLN 70 :N, CA, C, O, CB, CG, CD,0E1,NE2 
ARP:GLY 125:N,CA,C,0 
ARP:SER 127:N,CA,C,0,CB,OG 
ARP:PRO 133:N,CA,CD,C,0,CB,CG 
20 ARP:SER 299 :N,CA,C,0,CB,OG 

ARP:ALA 300:N,CA,C,O,CB 
ARP:VAL 301:N,CA,C,O,CB,CGl,CG2 
ARP:SER 334:N,CA,C,0,CB,OG 
ARP:GLY 335:N,CA,C,0 
25 ARP:PRO 336:N,CA,CD,C,0,CB,CG 

Subset SUB5B: 

subSbmole . list 
Subset SUB5B: 

ARP: 10-11 ,34,38, 65-68 ,71-72 , 120-121 , 123-124 , 
30 128-132,134,270,274, 

ARP: 297-298 ^302-303, 311-312, 332-333, 337-338 

sub5batoin. list 
Subset SUB5B: 

ARPrVAL 10:N,CA,C,O,CB,CGl,CG2 
35 ARP:THR 11:N,CA,C,0,CB,0G1,CG2 

ARP:GLN 34:N,CA,C,0,CB,CG,CD,0E1,NE2 

ARP:TYR 38:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

ARP:LEU 65:N,CA,C,0,CB,CG,CD1,CD2 

ARPiTHR 66:N,CA,C,0,CB,0G1,CG2 
40 ARP:ALA 67 : N, CA, C, O, CB 

ARP:ALA 68:N,CA,C,0,CB 

ARP:PHE 71:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
ARP:GLY 72:N,CA,C,0 



ARP:PHE 120 
45 ARP:ALA 121 

ARP:ALA 123 
ARP:VAL 124 
ARP:ASN 128 
ARP:CYS 129 
50 ARP:PRO 130 

ARP:GLY 131 
ARP:SER 132 
ARP:ARG 134 
ARP:GLY 270 
55 ARP:ARG 274 

ARP:ILE 297 
ARP: PRO 298 



N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
N,CA,C,0,CB 
N,CA,C,0,CB 
N,CA,C,0,CB,CG1,CG2 
N,CA,C,0,CB,CG,0D1,ND2 
N,CA,C,0,CB,SG 
N,CA,CD,C,0,CB,CG 
N,CA,C,0 
N,CA,C,0,CB,OG 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
N,CA,C,0 

N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
N , CA , C , 0 , CB , CG 1 , CG2 , CD 1 
N,CA,CD,C,0,CB,CG 
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ARPrSER 302 :N, CA, O, CB , OG 
ARPrASN 303 :N, CA, C, 0, CB, CG, ODl ,ND2 
ARPiGLY 311:N,CA,C,0 
ARP:GLY 312:N,CA,C,0 
5 ARPtTHR 332:N,CA,C,0,CB,OGl,CG2 

ARPrALA 333 :N,CA,C,0,CB 
ARP:LEU 337:N,CA,C,0,CB,CG,CD1,CD2 
ARP:PRO 338:N,CA^CD,C,0,CB^CG 
Subset ACTSITE: 
10 actsitemole* list 
Subset ACTSITE: 

ARP: 44-61, 75-77, 79-80, 87-88, 90-96, 

99 , 118 , 122 , 126 , 135 , 148-149 , 152-158 , 
ARP: 163-164 , 167 , 176-194 , 197-205 , 207-209 ,211- 
15 213,216,230-231,241, 

ARP: 243-246, 249, 259, 273, 277, 280, 343-347H 
acts i t eat om • 1 i s t 



Subset ACTSITE 



20 



25 



30 



35 



40 



45 



50 



55 



ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 
ARP 



GLU 44 
SER 45 
PRO 4 6 
VAL 47 
ARG 48 
LYS 49 
ILE 50 
LEU 51 
ARG 52 
ILE 53 
VAL 54 
PHE 55 
HIS 
ASP 
ALA 
ILE 
GLY 
PHE 
GLY 
ALA 
ASP 77 
SER 79 
ILE 80 
GLU 
LEU 
PHE 
PRO 

ALA 92 
ASN 93 
GLY 94 
GLY 95 
LEU 96 
THR 99 



56 
57 
58 
59 
60 
61 
75 
76 



87 
88 
90 
91 



,CA 
fCA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
f CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 
,CA 



0,CB,CG,CD,0E1,0E2 



0,CB,0G 
CD,C,0,CB,CG 
0,CB,CG1,CG2 

0,CB,CG,CD,NE,CZ,NH1,NH2 
0,CB,CG,CD,CE,NZ 
0,CB,CG1,CG2,CD1 
0,CB,CG,CD1,CD2 
0,CB,CG,CD,NE,CZ,NH1,NH2 
0,CB,CG1,CG2,CD1 
0,CB,CG1,CG2 

O , CB , CG , CDl , CD2 , CEl , CE2 , CZ 
O , CB , CG , ND 1 , CD2 , CEl , NE2 
0,CB,CG,0D1,0D2 
0,CB 

0,CB,CG1,CG2,CD1 
O 

0,CB,CG,CD1,CD2,CE1,CE2,CZ 
O 

0,CB 

0,CB,CG,0D1,0D2 
0,CB,0G 

0,CB,CG1,CG2,CD1 
0,CB,CG,CD,0E1,0E2 
0,CB,CG,CD1,CD2 
O , CB , CG , CDl , CD2 , CEl , CE2 , CZ 
CD,C,0,CB,CG 
0,CB 

0,CB;CG,0D1,ND2 

O 
0 

0,CB,CG,CD1,CD2 
0,CB,0G1,CG2 
ILE 118:N,CA,C,0,CB,CG1,CG2,CD1 
THR 122:N,CA,C,0,CB,0G1,CG2 
MET 126:N,CA,C,0,CB,CG,SD,CE 
LEU 135:N,CA,C,0,CB,CG,CD1,CD2 
SER 148:N,CA,C,0,CB,OG 
PRO 149:N,CA,CD,C,0,CB,CG 
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ARP: 


LEU 


152: 


N, 




ARP: 


ILE 


153: 


N, 




ARP: 


PRO 


154: 


N, 




ARP: 


GLY 


155: 


N, 


5 


ARP: 


PRO 


156: 


N, 




ARP: 


GLY 


157: 


N, 




ARP: 


ASN 


158: 






ARP: 


ILE 


163: 


N, 




ARP: 


LEU 


164: 


N, 


10 


ARP: 


MET 


167: 


N, 




ARP: 


GLU 


176: 


N, 




ARP: 


VAL 


177: 






ARP: 


VAL 


178: 


N, 




ARP: 


ASP 


179: 




15 


ARP: 


LEU 


180: 


N, 




ARP: 


LEU 


181: 


N, 




ARP: 


ALA 


182: 


N, 




ARP: 


ALA 


183: 


N, 




ARP: 


HIS 


184: 


N, 


20 


ARP: 


SER 


185: 






ARP: 


LEU 


186: 


N, 




ARP: 


ALA 


187: 


N, 




ARP: 


SER 


188: 


N, 




ARP; 


GLN 


189: 


N, 


25 


ARP: 


GLU 


190: 


Nr 




ARP: 


GLY 


191: 


N, 




ARP: 


:LEU 


192: 


:N, 




ARP: 


lASN 


193: 


IN, 




ARP: 


SER 


194; 


:N, 


30 


ARP: 


IPHE 


197: 


:N, 




ARP: 


tARG 


198' 


:N, 




ARP: 


:SER 


199. 


:N, 




ARP 


:PRO 


200^ 


:N, 




ARP 


:LEU 


201' 


:N, 


35 


ARP 


lASP 


202 


IN, 




ARP 


:SER 


203 


:N, 




ARP 


:THR 


204 


:N, 




ARP 


:PRO 


205 


:N, 




ARP 


:VAL 


207 


:N, 


40 


ARP 


:PHE 


208 


:N, 




ARP 


:ASP 


209 


:N, 




ARP 


:GLN 


211 


:N, 




ARP 


:PHE 


212 


:N( 




ARP 


:TYR 


213 


:N, 


45 


ARP 


:THR 


216 


:N, 




ARP 


:PHE 


230 


:N, 




ARP 


:ALA 


231 


:N, 




ARP 


:PHE 


241 


:N, 




ARP 


:MET 


243 


:N, 


50 


ARP 


:ARG 


244 


:N, 




ARP 


:SER 


245 


:N 




ARP 


:ASP 


246 


:N 




ARP 


;LEU 


249 


:N 




ARP 


:TRP 


259 


:N 


55 






CD2, 




ARP:TYR 


273 


:N 




ARP: MET 


277 


:N 



,CA,C,0,CB,CG,CD1,CD2 
,CA,C,0,CB,CG1,CG2,CD1 
,CA,CD,C,0,CB,CG 
,CA,C,0 

,CA,CD,C,0,CB,CG 
.CA,C,0 



CA 

CA, 

CA, 



CA,C,0 

,CA,C,0,CB,CG,0D1,ND2 
,CA,C,0,CB,CG1,CG2,CD1 
rCA,C,0,CB,CG,CDl,CD2 
,CA,C^O,CB,CG,SD,CE 
CA,C,0,CB,CG,CD,0E1,0E2 
CA,C,0,CB,CG1,CG2 
V,C,0,CB,CG1,CG2 
,C,0,CB,CG,0D1,0D2 
,C,0,CB,CG,CD1,CD2 
CA/C,0,CB,CG,CD1,CD2 
CA,C,0,CB 
CA,C,0,CB 

CA,C,0,CB,CG,ND1,CD2,CE1,NE2 
CA,C,0,CB,OG 
CA,C,0,CB,CG,CD1,CD2 
CA,C,0,CB 
CA,C,0,CB,OG 
CA,C,0,CB,CG,CD,0E1,NE2 
CA,C,0,CB,CG,CD,OEl,OE2 

CA;C,0 

CA,C, 



CA 
CA 
CA 
CA 
CA. 
CA, 
CA, 
CA, 



CA,C,0 
CA,C,0 



CA,C,0 

CA,C,0,CB,CG,CD1,CD2 
CA,C,0,CB,CG,0D1,NP2 
CA,C,0,CB,OG 

CA,C,0,CB,CG,CD1,CD2,CE1,CE2, 
CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
CA,C,0,CB,OG 
CA,CD,C,0,CB,CG 
CA,C,0,CB,CG,CD1,CD2 
CA,C,0,CB,CG,0D1,0D2 
CB,0G 



CZ 



,CA,C,0,CB,CG,0D1,0D2 
,CA,C,0,CB,0G 
XA,C,0, CB,0G1,CG2 
'*\,CD,C,0,CB,CG 
^,C,0,CB,CG1,CG2 
WC,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
V,C,0,CB,CG,0D1,0D2 
.C,0,CB,CG,CD,0E1,NE2 
, C , O , CB , CG , CDl , CD2 , CEl , CE2 , CZ 
,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ, 
C,0,CB.0G1.CG2 



w , w , , CG , CDl , CD2 , CEl , CE2 , CZ 
C,0,CB,CG,CD1,CD2,CE1,CE2,CZ, 
C A , C , O , CB , OG 1 , CG2 

CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , CZ 
CA,C,0,CB 

m _ rni _ rn5 . cki . ce2 . cz 



OH 



,CA,C,0,CB 

,CA,C,0,CB,CG,CD1,CD2,CE1,CE2, 
,CA,C,0,CB,CG,SD,CE 
,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 
,CA,C,0,CB,OG 
, CA , C , 0 , CB , CG , OD 1 , 0D2 
,CA,C,0,CB,CG,CD1,CD2 
,CA,C,0,CB,CG,CD1, 
, NEl , CE2 , CE3 , CZ2 , CZ3 , CH2 
rA.f!.0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 
CB,CG,SD,CE 
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ARP:MET 280:N,CA,C,O,CB,CG,SD,CE 
ARP:ALA 343:N,CA,C,0,CB 
ARPrPRO 344:N,CA,CD,C,O,0XT,CB,CG 
ARPlHEM 345H:FE,NA,NB,NC,ND,CHA,CHB, 
5 CHC,CHD,C1A,C2A,C3A,C4A,CMA,CAA,CBA,CGA 
ARPrHEM 345H:01A,02A,C1B,C2B,C3B,C4B,CMB, 

CAB,CBB,C1C,C2C,C3C,C4C,CMC,CAC,CBC 
ARPtHEM 345H:C1D,C2D,C3D,C4D,CMD,CAD,CBD,CGD,01D,02D 
ARP:CA 346H:CA 
10 ARP:CA 347H:CA 

Subset RESTx: 

restxmole. list 
Subset RESTX 

NEWMODEL: 9 ,334-336 
15 restxatoin . list 
Subset RESTX: 

NEWMODELlSER 9:N,CA,C,0,CB,0G 
NEWMODEL: SER 3 34 : N , CA , C , O , CB , OG 
NEWMODEL: GLY 335:N,CA,C,0 
20 NEWMODEL:PRO 336 :N, CA, CD, C, O, CB, CG 



Example 5 

Activation of mPEG 15,000 with N-succinimidvl carbonate 

25 mPEG 15,000 was suspended in toluene (4 ml/g of idPEG) 20% was 

distilled off at normal pressure to dry the reactants 
azeotropically. Dichloromethane (dry 1 ml/g mPEG) was added when 
the solution was cooled to 30 *C and phosgene in toluene (1.93 M 5 
mole/mole mPEG) was added and mixture stirred at room temperature 

30 over night. The mixture was evaporated to dryness and the desired 
product was obtained as waxy lumps. 

After evaporation dichloromethane and toluene (1:2, dry 3 
ml/g mPEG) was added to re-dissolve the white solid. N-Hydroxy 
succinimide (2 mole/mole mPEG.) was added as a solid and then 

35 triethylamine (1.1 mole/mole mPEG) . The mixture was stirred for 3 
hours, initially vmclear, then clear and ending with a small 
precipitate. The mixture was evaporated to dryness and 
recrystallised from ethyl acetate (10 ml) with warm filtration to 
remove salts and insoluble traces. The blank liquid was left for 

40 slow cooling at ambient temperature for 16 hours and then in the 
refrigerator over night. The white precipitate was filtered and 
washed with a little cold ethyl acetate and dried to yield 98 % 
(w/w) . NMR Indicating 80 - 90% activation and 5 o/oo (w/w) 
HNEt3Cl. ^H-NMR for mPEG 15,000 (CDCI3) d 1.42 t (1= 4.8 CH3 i 

45 HNEtaCl) , 2.84 s (1= 3.7 succinimide), 3.10 dq (1= 3.4 CH2 i 
HNEtaCl), 3.38 s (1= 2.7 CH3 i OMe) , 3.40* dd (I = 4.5 o/oo, ^^C 
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satellite), 3,64 bs (I = 1364 main peak), 3.89* dd (I = 4.8 o/oo , 
^^C satellite), 4.47 dd (I = 1*8, CH2 in PEG). No change was seen 
after storage in a desiccator at 22 ®C for 4 months. 

5 Example 6 

Activation of laPEG 5.000 with N-succinimidvl carbonate 

Activation of mPEG 5,000 with N-succinimidyl carbonate was 
performed as described in Example 5. 

10 EXAMPLE 7 

Construction and expression of PD498 variants ; 

PD498 site-directed variants were constructed using the "maxi- 

oligonucleotide-PCR" method described by Sarkar et al., (1990): 

BioTechniques 8: 404-407. 
15 The template plasmid was shuttle vector pPD498 or an analogue 

of this containing a variant of the PD498 protease gene. 

The following PD498 variants were constructed, expressed and 

purified. 

A: R28K 
20 B: R62K 

C: R169K 

D: R28K + R62K 

E: R28K + R169K 

F: R62K + R169K 
25 G: R28K+R69K+R169K 

Construction of variants 

For introduction of the R28K substitution a synthetic 

oligonucleotide having the sequence: GGG ATG TAA CCA AGG GAA GCA 
30 GCA CTC AAA CG (SEQ ID NO. 7) was used. 

A PGR fragment of 769 bp was ligated into the pPD498 plasmid 

prepared by Bst E II and Bgl II digestion. Positive variants were 

recognized by Sty I digestion and verified by DNA sequencing of the 

total 769 bp insert. 
35 For introduction of the R62K substitution a synthetic 

oligonucleotide having the sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) was used. 

A PCR fragment of 769 bp was ligated into the pPD498 plasmid 
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prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by Clal digestion and verified by DNA sequencing of the 
total 769 bp insert. 

For introduction of the R169K substitution a synthetic 
5 oligonucleotide having the sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) was used. 

A PGR fragment of 769 bp was ligated into the pPD498 plasmid 
prepared by Bst E II and Bgl II digestion. Positive variants were 
recognized by the absence of a Rsa I restriction site and verified 
10 by DNA sequencing of the total 769 bp insert. 

For simultaneously introduction of the R28K and the R62K 
substitutions, synthetic oligonucleotides having the sequence: 
GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 7) and the 
sequence : 

15 CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and verified 
by DNA sequencing of the total 769 bp insert. 

20 For simultaneously introduction of the R28K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: GGG 
ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID NO. 8) and the 
sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 8) were used 
25 simultaneously. A PCR fragment of 769 bp was ligated into the 
pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert. 

30 For simultaneously introduction of the R62K and the R169K 
substitutions, synthetic oligonucleotides having the sequence: CGA 
CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO. 8) and the sequence: 
CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 

35 PPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Clal digestion and absence of a Rsa I 
site. The variant was verified by DNA sequencing of the total 769 
bp insert 
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For simultaneously introduction of the R28K, the R62K and the 
R169K substitutions, synthetic oligonucleotides having the 
sequence: 

GGG ATG TAA CCA AGG GAA GCA GCA CTC AAA CG (SEQ ID No. 1) , the 
5 sequence: 

CGA CTT TAT CGA TAA GGA CAA TAA CCC (SEQ ID NO* 8) and the 
sequence: 

CAA TGT ATC CAA AAC GTT CCA ACC AGC (SEQ ID NO. 9) were used 
simultaneously. A PCR fragment of 769 bp was ligated into the 
10 pPD498 plasmid prepared by Bst E II and Bgl II digestion. Positive 
variants were recognized by Styl and Clal digestion and absence of 
a Rsa I site. The variant was verified by DNA sequencing of the 
total 769 bp insert. 

15 Fermentation, expression and purification of PD498 variants 

Vectors hosting the above mentioned PD498 variants were 
purified from E. coli cultures and transformed into S. subtllis in 
which organism the variants were fermented, expressed and purified 
as described in the "Materials and Methods" section above. 

20 

Example 7 

Conjugation of triple substitited PD498 variant with activated 
mPEG 5,000 

200 mg of triple sxibstituted PD498 variant (i.e. the 
25 R28K+R62K+R169K substituted variant) was incubated in 50 ram 
NaBorate, pH 10, with 1.8 g of activated mPEG 5,000 with N- 
succinimidyl carbonate (prepared according to Example 2) , in a 
final volxime of 20 ml. The reaction was carried out at ambient 
temperature using magnetic stirring. Reaction time was 1 hour. The 
30 reaction was stopped by adding DMG buffer to a final concentration 
of 5 mM dimethyl glutarate, 1 mM CaCl2 and 50 mM borate, pH 5.0. 

The molecule weight of the obtained derivative was approxi- 
mately 120 kDa/ corresponding to about 16 moles of mPEG attached 
per mole enzyme. 

35 Compared to the parent enzyme, residual activity was close to 
100% towards peptide substrate (succinyl-Ala-Ala-Pro-Phe-p- 
Nitroanilide) . 
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Example 8 

Alleraenicitv trails of PD498 variant-SPEGS . OOP in cruinea pigs 

Dunkin Hartley guinea pigs are stimulated with 1.0 ^ig PD498- 
SPEC 5,000 and 1-0 ^lg modified variant PD498-SPEG 5,000 by 
5 intratracheal installation. 

Sera from immunized Dunkin Hartley guinea pigs are tested 
during the trail period in a specific IgGi ELISA (described above) 
to elucidate whether the molecules could activate the immune 
response system giving rise to a specific IgGi response indicating 
10 an allergenic response. 

The IgGi levels of Dunkin Hartley guinea pigs dtiring the trail 
period of 10 weeks are observed. 

Example 9 

15 Suitable substitutions in Humicola lanuginosa lipase for 

addition of amino attachment groups f-NHo ^ 

The 3D structure of Humicola lanuginosa lipase (SEQ ID NO 6) 

is available in Brookhaven Databank as Itib^pdb. The lipase 

consists of 269 amino acids. 
20 The procedure described in Example 1 was followed. The 

sequence of H. lanuginosa lipase is shown below in the table 

listing solvent accessibility data for H. lanuginosa lipase. 

H. lanuginosa residue numbering is used (1-269) , and the active 

site residues (functional site) are S146, S201 and H258. The 
25 synonym TIB is used for H. lanuginosa lipase. 

The commands performed in Insight (BIOSYM) are shown in the 

command files makeKzone.bcl and makeK2one2 .bcl below: 

Conservative substitutions: 

30 makeKzone.bcl 

1 Delete Subset * 

2 Color Molecule Atoms * Specified Specification 255,0,255 

3 Zone Subset LYS :lys:NZ Static monomer /residue 10 
Color_Subset 255,255,0 

35 4 Zone Subset NTERM :1:N Static monomer /residue 10 
Color_Subset 255,255,0 

5 #NOTE: editnextline ACTSITE residues according to the 
protein 

6 Zone Subset ACTSITE : 146, 201, 258 Static monomer/residue 8 
40 Color_Subset 255,255,0 

7 Combine Subset ALLZONE Union LYS NTERM 

8 Combine Subset ALLZONE Union ALLZONE ACTSITE 

9 #NOTE: editnextline object ncune according to the protein 
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10 Combine Subset REST Difference TIB ALLZONE 

11 List Subset REST Atom Output_File restatom. list 

12 List Subset REST monomer/residue Output_File restmole. list 

13 Color Molecule Atoms ACTSITE Specified Specification 255,0,0 
5 14 List Subset ACTSITE Atom Output File actsiteatom. list 

15 List Subset ACTSITE monomer/residue Output^File 
actsitemole • list 

16 # 

17 Zone Subset REST5A REST Static Monomer/Residue 5 - 
10 Color_Subset 

18 Combine Subset SUB5A Difference REST5A ACTSITE 

19 Combine Subset SUB5B Difference SUB5A REST 

20 Color Molecule Atoms SUB5B Specified Specification 
255,255,255 

15 21 List Subset SUB5B Atom Output File sub5batom. list 

22 List Subset SUB5B monomer/ residue Output_File sub5bmole. list 

23 #Now identify sites for lys->arg substitutions . and continue 
with makezone2 .bcl 

24 #Use grep command to identify ARG in restatom. list , 
20 sub5batom, list & accsiteatom. list 

Comments : 

In this case of H. lanuginosa (=TIB) , REST contains the 
Arginines Argl3 3, Argl39, Argl60, Argl79 and Arg 209, and SUB5B 
25 contains ArgllS and R125. 

These residues are all solvent exposed. The substitutions 
R133K, R139K, R160K, R179K, R209K, R118K and R125K are 
identified in TIB as sites for mutagenesis within the scope of 
this invention. The residues are substituted below in section 
30 2, and further analysis done. The subset ACTSITE contains no 
lysines. 



Non-conservative substitutions: 
makeRzone2 • bcl 

35 1 #sourcefile malcezone2 .bcl Claus von der Osten 961128 

2 # 

3 #having scanned lists (grep arg command) and identified 
sites for lys->arg substitutions 

4 #NOTE: editnextline object name according to protein 
40 5 Copy Object -To__Clipboard -Displace TIB newmodel 

6 Biopolymer 

7 #NOTE: editnextline object name according to protein 

8 Blank Object On TIB 

9 #NOTE: editnext lines with lys->arg positions 
45 10 Replace Residue newmodel: 118 lys L 

11 Replace Residue newmodel: 125 lys L 

12 Replace Residue newmodel: 133 lys L 

13 Replace Residue newmodel: 139 lys L 

14 Replace Residue newmodel: 160 lys L 
50 15 Replace Residue newmodel: 179 lys L 

16 Replace Residue newmodel: 209 lys L 
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17 # . , 

18 #Now repeat analysis done prior to arg->lys, now including 
introduced lysines 

19 Color Molecule Atoms newmodel Specified Specification 
5 255 0 255 

20 'zone Subset LYSx newmodel: lys:NZ Static monomer/residue 10 
Color^Subset 255,255,0 

21 Zone Subset NTERMx newmodel: 1:N Static monomer /residue 10 
Color_Subset 255,255,0 

10 22 #NOTE: editnextline ACTSITEx residues according to the 
protein 

23 Zone Subset ACTSITEx newmodel: 146, 201, 258 Static 
monomer /residue 8 Color_Subset 255,255,0 

24 Combine Subset ALLZONEx Union LYSx NTERMX 

15 25 combine Subset ALLZONEx Union ALLZONEx ACTSITEx 

26 Combine Subset RESTx Difference newmodel ALLZONEx 

27 List Subset RESTx Atom Output_File restxatom.list 

28 List Subset RESTx monomer /residue Output_File 
restxmole. list 

20 29 # 

30 Color Molecule Atoms ACTSITEx, Specified Specification 
255,0,0 

31 List Subset ACTSITEx Atom Output^File acts itexatom. list 

32 List Subset ACTSITEx monomer/ residue Output_File 
25 actsitexmole.list 

33 # 

34 #read restxatom.list or restxmole. list to identify sites 
for (not_arg) ->lys subst. if needed 

30 Comments: 

Of the residues in RESTx, the following are >5% exposed (see 
lists below): 18,31-33,36,38,40,48,50,56-62,64,78,88,91-93,104- 
106,120,136,225,227-229,250,262,268. Of these three are 
Cysteines involved in disulfide bridge formation, and 
35 consequently for structural reasons excluded from the residues 
to be mutated. The following mutations are proposed in H. 
lanugrinosa lipase (TIB) : 

A18K,G31K,T32K,N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K, 
V60K,G61K,D62K,T64K,L78K,N88K,G91K,N92K,L93K,S105K,G106K, 
40 V120K,P136K,G225K,L227K,V228K,P229K,P250K,F262K. 
Relevant data for Example 2: 

# TIBN0H20 

# residue area 
GLU_1 110.792610 

45 VAL_2 18.002457 

SER_3 53.019516 

GLN_4 85,770164 

ASP 5 107.565826 

tEU^6 33.022659 

50 PHE_7 34.392754 

ASN 8 84.855331 
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GLN 


9 


39.175591 




phe" 


10 


2.149547 




ASN 


11 


40.544380 




leu" 


12 


27.648788 


5 


PHE 


13 


2.418241 




ALA"" 


14 


4.625293 




GLN 


15 


28.202387 




tyr" 


16 


0.969180 




ser" 


17 


0.000000 


10 


ALA 


18 


7.008336 




ALA" 


19 


0.000000 




ALA 


20 


0.000000 




TYR 


"21 


6.947358 




CYS" 


22 


8.060802 


15 


gly" 


23 


32.147034 




LYS" 


"24 


168.890747 




ASN 


"25 


8.014721 




ASN 


26 


11.815564 




ASP 


27 


92.263428 


20 


ALA 


28 


18.206699 




pro" 


29 


83.188431 




ALA" 


"30 


69.428421 




gly" 


"31 


50.693439 




thr" 


32 


52.171135 


25 


ASN" 


"33 


111.230743 




ile" 


"34 


2.801945 




thr" 


"35 


82.130569 




CYS" 


'36 


17.269245 




thr" 


■37 


96.731941 


30 


GLY 


"38 


77.870995 




ASN* 


'39 


123.051003 




ALA" 


"40 


27.985256 




CYS 


"41 


0.752820 




pro" 


■42 


46.258949 


35 


GLU" 


'43 


69.773987 




val" 


■44 


0.735684 




GLU" 


'45 


77.169510 




LYS" 


'46 


141.213562 




ALA 


"47 


10.249716 


40 


asp' 


"48 


109.913902 




ALA" 


"49 


2.602721 




thr" 


"50 


32.012184 




phe" 


"51 


8.255627 




leu" 


""52 


60.093613 


45 


tyr" 


"53 


77.877937 




ser" 


"54 


26.980494 




phe" 


"55 


10.747735 




GLU 


"56 


112.689758 




asp" 


"57 


92.064278 


50 


ser 


58 


32 . 990780 




gly" 


■59 


53.371807 




val" 


"60 


83.563644 




gly" 


"61 


69.625633 




asp" 


"62 


75.520988 


55 


val" 


"63 


4.030401 




thr" 


"64 


8.652839 




gly" 


"65 


0.000000 
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PHE 


66 


0.268693 




LEU 


67 


11.822510 




ALA 


68 


0.537387 




LEU 


69 


30.243870 


5 


ASP 


70 


0.000000 




ASN 


"71 


84.101044 




THR 


72 


89.271126 




ASN 


"73 


70.742401 




LYS~ 


"74 


98.319168 


10 


LEU" 


"75 


8.329495 




ILE 


76 


5.197878 




VAL 


"77 


0.806080 




LEU" 


'78 


5.293978 




SER 


'79 


0.000000 


15 


PHE~ 


'80 


2.079151 




ARG 


"81 


41.085312 




gly" 


'82 


1.471369 




ser" 


83 


43.794014 




arg" 


'84 


100.261627 


20 


SER 


'85 


70.607552 




ILE~ 


'86 


59.696865 




GLU" 


"87 


136.510773 




ASN" 


"88 


119.376373 




TRP 


'89 


102.851227 


25 


ILE' 


"90 


78.068588 




GLY" 


"91 


60.783607 




ASN" 


"92 


45.769428 




leu" 


"93 


134.228363 




asn" 


"94 


101.810959 


30 


PHE 


"95 


41.212212 




asp" 


"96 


79.645950 




leu" 


'97 


25.281572 




LYS" 


"98 


88.840263 




GLU" 


■99 


132.377090 


35 


ILE" 


"100 


9.135575 




asn" 


"lOl 


63.444527 




asp" 


"102 


88.652847 




ile" 


"103 


33.470661 




CYS" 


"104 


11.553816 


40 


ser' 


"105 


99.461174 




gly' 


~106 


40.325161 




CYS' 


107 


4.433561 




arg" 


"IO8 


97.450104 




gly' 


"109 


1.343467 


45 


his' 


"110 


4.652464 




asp' 


"ill 


37.023655 




gly' 


~112 


29.930408 




phe' 


"113 


14.976435 




thr] 


_114 


10.430954 


50 


ser" 


~115 


40.606895 




ser" 


"116 


13.462922 




TRP" 


"117 


10.747735 




arg" 


"118 


114.364281 




ser" 


"119 


46.880249 


55 


VAL 


120 


13.434669 




ALA' 


"121 


18.258261 




ASP' 


"122 


110.753098 
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THR_123 
LEU_124 
ARG_125 
GLN_126 
5 LYS_127 
VAL_128 
GLU_129 
ASP_130 
ALA_131 

10 VAL_132 
ARG_133 
GLU_134 
HIS_13 5 
PRO_136 

15 ASP_137 
TYR_138 
ARG_139 
VAL_140 
VAL_141 

20 PHE_142 
THR_143 
GLY_144 
HIS_145 
SER_146 

25 LEU_147 
GLY_148 
GLY_149 
ALA_150 
LEU_151 

30 ALA_152 
THR_153 
VAL_154 
ALA_155 
GLY_156 

35 ALA_157 
ASP_158 
LEU_159 
ARG_160 
GLY_161 

40 ASN_162 
GLY_163 
TYR_164 
ASP_165 
ILE_166 

45 ASP_167 
VAL_168 
PHE_169 
SER_170 
TYR_171 

50 GLY_172 
ALA_173 
PRO_174 
ARG_175 
VAL_176 

55 GLY_177 
ASN_178 
ARG 179 



69.641922 

17.090784 

73.929977 

101.320190 

84.450241 

6.448641 

47.700993 

75.529091 

11.340775 

27.896025 

153.136490 

132.140594 

54.553406 

97.386963 

22.653191 

35.392658 

74.321243 

10.173222 

0.233495 

3.224321 

0. 000000 

0.000000 

4.514527 

15.749787 

40.709171 

0.000000 

0.000000 

0.537387 

22.838938 

0.268693 

18.078798 

7.254722 

0. 000000 

0.000000 

15.140230 

41.645477 

6.144750 

41.939716 

68.978180 

68.243805 

79.181274 

36.190247 

103.068283 

0.000000 

24.326443 

4.299094 

0.466991 

3.339332 

0.000000 

0.000000 

12.674671 

13.117888 

10. 004488 

21.422220 

2.680759 

21.018063 

110.282166 
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ALA 180 


33.210381 




PHE 181 


4.567788 




ALA 


182 


3 .897251 




GLU 


183 


76.354004 


5 


PHE~ 


184 


71.225983 




LEU 


185 


24.985012 




THR 


186 


47.023815 




VAL 


187 


98.244606 




GLN 188 


54.152954 


10 


THR 189 


88.660645 




GLY 


190 


24.792120 




gly" 


191 


10.726818 




thr" 


'192 


45.458744 




leu" 


"193 


16.633211 


15 


tyr" 


"194 


34.829491 




ARG 


"195 


29.030851 




ILE 


'196 


1.973557 




thr" 


"197 


3.493014 




his" 


"198 


1.532270 


20 


thr" 


'199 


34.785877 




ASN 


"200 


39.789238 




ASP 201 


0.000000 




ILE 202 


31.168434 




VAL 


203 


29.521076 


25 


pro" 


'204 


3.515322 




arg" 


[205 


44.882454 




LEU 


20o 


51.051746 




PRO^ 


"207 


12.575329 




pro" 


'208 


43.259636 


30 


arg" 


"209 


113.700233 




GLU" 


"210 


154.628540 




PHE" 


'211 


112.505188 




gly" 


'212 


30.084938 




tyr" 


'213 


3 .268936 


35 


ser" 


'214 


12.471436 




his" 


"215 


23.354481 




ser" 


'216 


16-406200 




ser" 


'217 


14.665598 




PRO" 


"218 


17.240993 


40 


GLU" 


'219 


13.145291 




tyr' 


"22 0 


18.718306 




TRP" 


'221 


39.229233 




ILE" 


"222 


5.105175 




lys" 


"223 


120.739983 


45 


SER* 


"224 


15.407301 




gly' 


'225 


29.306646 




thr" 


"226 


66.806862 




LEU" 


'227 


122.682808 




VAL" 


"228 


60.923004 


50 


pro" 


"229 


104 . 620377 




VAL" 


230 


23.398251 




thr' 


"231 


63.372971 




arg' 


"232 


80.357857 




asn" 


"233 


89.255066 


55 


asp" 


"234 


43.011250 




ILE' 


"235 


2.114349 




VAL" 


"236 


45.140491 
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LYS 


237 


105.651306 




ile" 


238 


24.671705 




GLU" 


"239 


116.891907 




GLY 240 


31.965794 


5 


ILE""241 


46.278099 




ASP 


242 


28.963699 




ALA" 


"243 


25.158146 




thr" 


"244 


98.351440 




gly" 


"245 


43.842186 


10 


gly" 


"246 


0.700486 




ASN 


"247 


3.926274 




asn" 


'248 


51.047890 




gln" 


"249 


66.699188 




pro" 


"250 


132.414047 


15 


asn" 


"251 


70.213730 




ILK 


"252 


141.498062 




pro" 


"253 


59.089233 




asp" 


"254 


59.010895 




ile" 


"255 


63.298943 


20 


PRO 


"256 


78.608688 




ALA" 


"257 


0.806080 




his" 


"258 


3.761708 




leu" 


"259 


50.747856 




TRP" 


"260 


35.229710 


25 


tyr" 


"261 


5.440791 




phe" 


"262 


36.457939 




gly" 


"263 


22.071375 




leu" 


'2 64 


109.148178 




ile" 


"265 


2.418241 


30 


gly" 


'266 


17.730062 




thr" 


"267 


68.217873 




CYS" 


'268 


15.418195 




leu" 


"2 69 


165.990997 



Subset REST: 
35 restmole. list 
Subset REST: 

TIB:5,8-9, 13-14, 16, 18-20, 31-34, 36, 38, 40, 48-50, 56- 
66,68,76-79,88,91-93, 

TIB: 100-107 , 116-117 , 119-121 , 132-134 , 136 , 139-142 , 154- 
40 169,177-185, 

TIB : 187, 189-191, 207-2 12, 214-216, 225, 227-229, 241- 

244,250,262,268 
restatom. list 
Subset REST: 
45 TIB:ASP 5 : N, CA, C, O, CB, CG , ODl , 0D2 

TIB:ASN 8:N,CA,C,0,CB,CG,0D1,ND2 

TIB: GLN 9 : N, CA, C, O , CB, CG, CD, OEl , NE2 

TIB : PHE 13 : N , CA , C , O , CB , CG , GDI , CD2 , CEl , CE2 , CZ 

TIB:ALA 14 :N,CA,C,0,CB 
50 TIB : TYR 16 : N, CA, C, O, CB, CG, GDI , CD2 , CEl, CE2 , CZ , OH 

TIB:AIiA 18:N,CA,C,0,CB 

TIB:ALA 19:N,CA,C,0,CB 

TIBiALA 20:N,CA,C,0,CB 

TIB:GLY 31:N,CA,C,0 
55 TIB:THR 32:N,CA,C,0,CB,0G1,CG2 

TIB: ASN 33:N,CA,C,0,CB,CG,0D1,ND2 

TIB:ILE 34:N,CA,C,0,CB,CG1,CG2,CDI 
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TIB: 


CYS 


36: 


N, 


CA, 


c, 


0, 


CB,SG 




TIB: 


GLY 


38: 


N, 


CA, 


c, 


0 






TIB: 


ALA 


40: 


N, 


CA, 


c, 


0, 


CB 




TIB: 


ASP 


48: 


N, 


CA, 


c, 


0, 


CB,CG,0D1,0D2 


5 


TIB: 


ALA 


49: 


N, 


CA, 


c, 


0, 


CB 




TIB: 


THR 


50: 


N, 


CA, 


c, 


0, 


CB,0G1,CG2 




TIB: 


GLU 


56: 


N, 


CA, 


c, 


0, 


CB,CG,CD,0E1,0E2 




TIB: 


ASP 


57: 


;N, 


CA, 


c, 


0, 


CB,CG, 001^002 




TIB: 


SER 


58: 


;N, 


CA, 


c, 


Or 


CB,OG 


10 


TIB: 


.GLY 


59: 


:N, 


CA, 


c, 


0 






TIB: 


VAL 


60: 


:N, 


CA, 


c, 


0, 


CB,CG1,CG2 




TIB: 


:GLY 


61: 


:N, 


CA, 


c, 


0 






TIB: 


lASP 


62: 


:N, 


CA, 


c, 




CB,CG,0D1,0D2 




TIB: 


^VAL 


63' 


:N, 


CA, 


c, 


0, 


CB,CG1,CG2 


15 


TIB: 


:THR 


64 


:N, 


CA, 


c, 


0, 


CB,0G1,CG2 




TIB: 


:GLY 


65 


:N, 


rCA, 


c, 


0 






TIB: 


:PHE 


66' 


:N, 


CA, 






CB , CG , CDl , CD2 , CEl , CE2 , CZ 




TIB: 


^ALA 


68 


:N, 


rCA, 






CB 




TIB: 


:ILE 


76 


:N, 


rCA, 




fO, 


CB,CG1,CG2,CD1 


20 


TIB' 


:VAL 


77 


:N, 


rCA, 


rC, 


0, 


CB,CG1,CG2 




TIB. 


:LEU 


78 


:N, 


rCA, 


rC, 


0, 


CB,CG,CD1,CD2 




TIB, 


:SER 


79 


;N, 


rCA, 




fO, 


CB,OG 




TIB 


:ASN 


88 


:N, 


rCA, 


rC, 


fO, 


CB,CG,0D1,ND2 




TIB 


:GLY 


91 


:N 


rCA, 


rC, 


fO 




25 


TIB 


:ASN 


92 


:N, 


rCA, 




fO, 


CB,CG,0D1,ND2 




TIB' 


:LEU 


93 


:N, 


rCA, 


fC, 




CB,CG,CD1,CD2 





TIB 


:ILE 


100: 


N, 


CA, 


c, 


o< 


,CB,CG1,CG2,CD1 






TIB 


:ASN 


101: 


N, 


CA, 


c, 


0, 


,CB,CG,0D1,ND2 






TIB 


:ASP 


102: 


N, 


CA, 


c, 


0, 


,CB,CG,ODl,OD2 




30 


TIB 


:ILE 


103:N, 


CA, 


c, 


Oi 


,CBrCGl,CG2,CDl 






TIB 


:CYi5 


104:N, 


CA, 


c, 


0 


,CB,SG 






TIB 


:SER 


105: 


N, 


CA, 


c, 


0, 


,CB,OG 






TIB 


:GLY 


106: 


N, 


CA, 


c, 


0 








TIB 


:CYS 


107: 


N, 


CA, 


c, 


0 


, CB , SG 




35 


TIB 


rSER 


116: 


N, 


CA, 


c, 


0 


,CB,OG 






TIB 


:TRP 


117: 


N, 


CA, 


c, 


0 


,CB,CG,CD1,CD2,NE1, 


CE2, 




CE3,CZ2 


,CZ3 


,CH2 












TIB: SER 


119: 


N, 


CA, 


c, 


0 


,CB,OG 






TIB: VAL 


120: 


N, 


CA, 




0 


,CB,CG1,CG2 




40 


TIB 


:ALA 


121: 


N, 


CA, 




0 


,CB 






TIB 


:VAL 


132: 


N, 


CA, 




rO 


,CB,CG1,CG2 






TIB 


:ARG 


133: 


N, 


CA, 




rO 


,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: GLU 


134: 


N, 


CA, 




rO 


,CB,CG,CD,0E1,0E2 






TIB: PRO 


136: 


N, 


CA, 


CD,C,0,CB,CG 




45 


TIB 


:ARG 


139: 


N, 


.CA, 




fO 


,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB 


:VAL 


140: 


:N, 


rCA, 




fO 


,CB,CG1,CG2 






TIB: VAL 


141: 


:N, 


rCA, 




fO 


,CB,CG1,CG2 






TIB 


:PHE 


142: 




rCA, 


C 


fO 


,CB,CG,CDl,CD2rCEl, 


CE2,CZ 




TIB 


:VAL 


154: 




rCA, 


.C 


rO 


,CB,CG1,CG2 




50 


TIB 


:ALA 


155: 




rCA, 


rC 


fO 


,CB 






TIB: GLY 


156: 




rCA, 




fO 








TIB: ALA 


157: 


IN, 


rCA, 


pC 


fO 


,CB 






TIB 


:ASP 


158' 


:N 


pCA, 


fC 


fO 


,CB,CG,0D1,0D2 






TIB: LEU 


159, 


:N 


rCA, 


fC 


fO 


,CB,CG,CD1,CD2 




55 


TIB:ARG 


160 


IN 


rCA, 


pC 


rO 


,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB 


:GLY 


161 


:N 


,CA 


fC 


fO 








TIB:ASN 


162 


:N 


,CA 


fC 


fO 


,CB,CG,0D1,ND2 
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TIB: 


GLY 


163: 


N, 


CA, 


C,0 






TIB: 


TYR 


164: 


N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2 , CZ , OH 




TIB: 


ASP 


165: 




CA, 


C,0,CB,CG,0D1,0D2 






TIB: 


ILE 


166: 


N, 


CA, 


C,0,CB,CG1,CG2,CD1 




5 


TIB: 


ASP 


167: 


N, 


CA, 


C,0,CB,CG,0D1,0D2 






TIB: 


VAL 


168: 


N, 


CA, 


C,0,CB,CG1,CG2 






TIB: 


PHE 


169: 


N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 




TIB! 


GLY 


177: 


N, 


CA, 


C,0 






TIB: 


ASN 


178: 


N, 


CA, 


C,0,CB,CG,0D1,ND2 




10 


TIB: 


ARG 


179: 


N, 


CA, 


C,0,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: 


ALA 


180: 


N, 


CA, 


C,0,CB 






TIB: 


PHE 


181: 


N, 


CA, 


C,0,CB,CG,CD1,CD2,CE1, 


CE2 , CZ 




TIB: 


ALA 


182: 


N| 


CA, 


C,0,CB 






TIB: 


GLU 


183: 




CA, 


C,0,CB,CG,CD,0E1,0E2 




15 


TIB: 


PHE 


184: 




CA, 


C , 0 , CB , CG , CD 1 , CD2 , CEl , 


CE2 , CZ 




TIB: 


LEU 


185: 




CA, 


C,0,CB,CG,CD1,CD2 






TIB: 


;VAL 


187: 


N, 


CA, 


C,0,CB,CG1,CG2 






TIB: 


:THR 


189: 




CA, 


C,0,CB,0G1,CG2 






TIB: 


GLY 


190: 


N, 


CA, 


c,o 




20 


TIB: 


.GLY 


191: 


►N, 


CA, 


c,o 






TIB: 


;PRO 


207: 


:N, 


CA, 


CD,C,0,CB,CG 






TIB: 


:PRO 


208: 


:N, 


CA, 


CD,C,0,CB,CG 






TIB: 


:ARG 


209: 


:N, 


CA, 


C,0,CB,CG,CD,NE,CZ,NH1 


,NH2 




TIB: 


:GLU 


210: 


:N, 


CA, 


C,0,CB,CG^CD,0E1,0E2 




25 


TIB: 


:PHE 


211: 


:N, 


CA, 


, C , 0 , CB , CG , CDl , CD2 , CEl , 


CE2 , CZ 




TIB; 


:GLY 


212: 




rCA, 


rC,0 






TIB J 


:SER 


214' 




rCA, 


C,0,CB,OG 






TIB' 


:HIS 


215 


'N, 


rCA, 


C,0,CB,CG,ND1,CD2,CE1, 


NE2 




TIB 


:SER 


216' 


:N, 


.CA, 


C,0,CB^OG 




30 


TIB 


:GLY 


225 


:N, 


rCA, 


rC,0 






TIB 


:LEU 


227 


:N, 


rCA, 


,C,0,CB,CG,CD1,CD2 






TIB 


:VAL 


228 


:N, 


rCA, 


,C,0,CB,CG1,CG2 






TIB 


:PRO 


229 


:N, 


rCA, 


,CD,C,0,CB,CG 






TIB 


:ILE 


241 


:N, 


rCA, 


,C,0,CB,CG1,CG2,CD1 




35 


TIB 


:ASP 


242 


:N, 


rCA, 


,C,0,CB,CG,0D1,0D2 






TIB 


:ALA 


243 


:N 


rCA, 


,C,0,CB 






TIB 


:THR 


244 


:N, 


,CA 


,C,0,CB,0G1,CG2 






TIB 


:PRO 


250 


:N, 


fCA, 


,CD,C,0,CB,CG 






TIB 


:PHE 


262 


:N 




,C,0,CB,CG,CD1,CD2,CE1, 


CE2,CZ 


40 


TIB 


:CYS 


268 




,CA, 


^ C , 0 , CB , S6 





Subset SUB5B: 
sub5mole. list 



Subset SUB5B: 

TIB: 3-4, 6-7, 10-12, 15, 22-23, 25-30, 35, 37, 39, 41-42, 44-47, 51- 
45 55,67,69-70, 

TIB: 72 , 74-75 , 94-99 , 108-112 , 114-115, 118 , 122-126 , 128- 
131,135,137-138, 

TIB: 186,188, 192-195,213,217-219,223-224,230-231,234-235, 238- 
240, 

50 TIB:245,269 

sub5batom. list 
Subset SUB5B: 

TIB:SER 3:N,CA,C,0,CB,0G 
TIB:GLN 4 :N, CA, C, O, CB, CG, CD, 0E1,NE2 
55 TIBiLEU 6:N,CA,C,0,CB,CG,CD1,CD2 

TIB: PHE 7:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 
TIB:PHE 10:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 



- wo 98^5026 



PCT/DK98/00046 



99 





TIB: 


ASN 


11: 


N,CA,C,0,CB,CG,0D1,ND2 






TIB: 


;LEU 


12: 


N,CA,C,0,CB,CG,CD1,CD2 






TIB: 


;GLN 


15: 


N,CA,C,0,CB,CG,CD,0E1,NE2 






TIB: 


;CYS 


22: 


.N,CA,C,0,CB,SG 




5 


TIB: 


.GLY 


23: 


:N,CA,C,0 






TIB: 


;ASN 


25: 


N,CA,C,0,CB,CG,0D1,ND2 






TIB: 


:ASN 


26: 


:N,CA,C,0,CB,CG,0D1,ND2 






TIB: 


:ASP 


27: 


N,CA,C,0,CB,CG,0D1^0D2 






TIB: 


ALA 


28: 


.N,CA,C,0,CB 




10 


TIB: 


:PRO 


29: 


|N,CA,CD,C,0,CB,CG 






TIB: 


.ALA 


30: 


N,CA,C,0,CB 






TIB: 


THR 


35: 


N,CA,C,0,CB,0G1,CG2 






TIB: 


.THR 


37: 


N,CA,C,0,CB,0G1,CG2 






TIB: 


;ASN 


39: 


:N,CA,C,0,CB,CG,0D1,ND2 




15 


TIB: 


:CYS 


41: 


'N,CA,C,0,CB,SG 






TIB: 


:PRO 


42: 


N,CA,CD,C,0,CB,CG 






TIB: 


:VAL 


44: 


.N,CA,C,0,CB,CG1,CG2 






TIB: 


:GLU 


45: 


;N,CA,C,0,CB,CG,CD,0E1,0E2 






TIB; 


:LYS 


46: 


N,CA,C,0,CB,CG,CD,CE,NZ 




20 


TIB: 


:ALA 


47: 


N,CA,C,0,CB 






TIB: 


;PHE 


51: 


N , CA , C , 0 , CB , CG , GDI , CD2 , CEl , CE2 , 


CZ 




TIB: 


;LEU 


52: 


.N,CA,C,0,CB,CG,CD1,CD2 






TIB: 


ITYR 


53: 


:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2, 


CZ,OH 




TIB: 


:SER 


54: 


N,CA,C,O,CB,0G 




25 


TIB: 


;PHE 


55: 


N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2, 


CZ 




TIB: 


:LEU 


67: 


N,CA,C,0,CB,CG,CD1,CD2 






TIB: 


:LEU 


69: 


;N,CA,C,0,CB,CG,CD1,CD2 






TIB: 


:ASP 


70: 


:N,CA,C,0> CB,CG,0D1,0D2 






TIB: 


:THR 


72: 


;N,CA,C,O,CB,0Gl,CG2 




30 


TIB: 


:LYS 


74: 


:N,CA,C,0,CB,CG,CD,CE,NZ 






TIB: 


;LEU 


75: 


;N,CA,C,0,CB,CG,CD1,CD2 






TIB: 


:ASN 


94 


:N,CA,C,O,CB,CG,0Dl,ND2 






TIB: 


:PHE 


95 


; N , OA , C , O , CB , CG , GDI , CD2 , CEl , CE2 , 


CZ 




TIB^ 


;ASP 


96: 


:N, CA,C,0,CB, CG, GDI, 0D2 




35 


TIB 


:LEU 


97; 


:N,CA,C,0,CB,CG,CD1,CD2 






TIB. 


:LYS 


98^ 


:N,CA,C,0,CB,CG,CD,CE,NZ 






TIB, 


:GLU 


99 


:N,CA,C,0,CB,CG,CD,0E1,0E2 






TIB 


:ARG 


108:N,CA,C,O,CB,CG,CD,NE,CZ,NHl,NH2 




TIB 


:GLY 


109:N,CA,C,0 




40 


TIB 


:HIS 


110:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 






TIB 


:ASR 


111:N,CA,C,0,CB,CG,OD1,OD2 






TIB 


:GLY 


112:N,CA,c;o 






TIB 


:THR 


114:N,CA,C,0,CB,0G1,CG2 






TIB 


:SER 


115:N,CA,C,0,CB,0G 




45 


TIB 


:ARG 


118:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 




TIB 


:ASP 


122:N,CA,C,0,CB,CG,ODl,OD2 






TIB 


:THR 


123:N,CA,C,0,CB,0G1,CG2 






TIB 


:LEU 


124:N,CA,C,0,CB,CG,CD1,CD2 






TIB 


:ARG 


125:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 


50 


TIB 


:GLN 


126:N,CA,C,0,CB,CG,CD,0E1,NE2 






TIB 


:VAL 


128:N,CA,C,0,CB,CG1,CG2 






TIB 


:GLU 


129:N,CA,C,0,CB,CG,CD,0E1, 0E2 






TIB 


:ASP 


130:N,CA,C,0,CB,CG,OD1,OD2 






TIB 


:ALA 


131:N,CA,C,0,CB 




55 


TIB 


:HIS 


135:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 






TIB 


:ASP 


137:N,CA,C,0,CB,CG,ODl,OD2 






TIB 


:TYR 


138:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2 


,CZ,OH 
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TIB: 


:THR 


186 


:N,CA,C,0, 


CB,0G1,CG2 






TIBj 


:GLN 


188 


:N,CA,C,0, 


CB,CG,CD,0E1,NE2 






TIB: 


:THR 


192 


:N,CA,C,0, 


CB , OGl , CG2 






TIB: 


:LEU 


193 


iN,CA,C,0, 


CB,CG,CD1,CD2 




5 


TIB: 


:TYR 


194 


lN,CA,C,0, 


CB,CG,CD1,CD2,CE1,CE2,CZ 


,0H 




TIB: 


:ARG 


195 


:N,CA,C,0, 


CB,CG,CD,NE,CZ,NH1,NH2 






TIB: 


:TYR 


213 


:N,CA,C,0, 


CB , CG , GDI , CD2 , CEl , CE2 , CZ 






TIB: 


:SER 


217 


:N,CA,C,0, 


CB,OG 






TIB: 


:PRO 


218. 


SN,CA,CD,C,0,CB,CG 




10 


TIB: 


:GLU 


219" 


:N,CA,C,0, 


CB,CG,CD,0E1,0E2 






TIB: 


:LYS 


223 


:N,CA,C,0, 


CB,CG,CD,CE,NZ 






TIB: 


\SER 


224 


;N,CA,C,0, 


CB,OG 






TIB: 


:VAL 


230' 


:N,CA,C,0, 


CB,CG1,CG2 






TIB: 


:THR 


231 


:N, CA,C,0, 


CB,0G1,CG2 




15 


TIB: 


:ASP 


234 


:N,CA,C,0, 


CB,CG,0D1,0D2 






TIB: 


:ILE 


235 


:N,CA,C,0, 


CB,CG1,CG2,CD1 






TIB: 


:ILE 


238 


:N,CA,C,0, 


CB,CG1,CG2,GD1 






TIB: 


:GLU 


239 


:N,CA,C,0, 


CB,CG,CD,0E1,0E2 






TIB 


:GLY 


240 


:N,CA,C,0 






20 


TIB: 


:GLY 


245 


:N,CA,C,0 








TIB: 


:LEU 


269 


:N,CA,C,0, 


CB,0XT,CG,CD1,CD2 





Subset ACTSITE: 

actsiteinole. list 



Subset ACTSITE: 

25 TIB: 17, 21, 80-87, 89-90, 113, 143-153, 170-176, 196-206, 221- 

222,226,246-249, 
TIB: 251-261, 263-267 
actsiteatom. list 
Subset ACTSITE: 
30 TIBrSER 17:N,CA,C,0,CB,0G 

TIBrTYR 21:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 

TIB : PHE 8 0 : N , CA , C , O , CB , CG , CDl , CD2 , CEl , CE2 , CZ 

TIB:ARG 81:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

TIB:GLY 82:N,CA,C,0 
35 TIB:SER 83:N,CA,C,0,CB,OG 

TIB:ARG 84:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

TIB:SER 85:N,CA,C,0,CB,OG 

TIB:ILE 86:N,CA,C,0,CB,CG1,CG2,CD1 

TIBlGLU 87:N,CA,C,0,CB,CG,CD,OEl,OE2 
40 TIB:TRP 89:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3,CZ2,CZ3,CH2 

TIB:ILE 90:N,CA,C,0,CB,CG1,CG2,CD1 

TIB: PHE 113:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 

TIB:THR 143:N,CA,C,0,CB,0G1,CG2 

TIBrGLY 144:N,CA,C,0 
45 TIB:HIS 145 : N, CA, C, O, CB, CG,ND1,CD2 , CE1,NE2 

TIB:SER 146:N,CA,C,0,CB,OG 

TIB:LEU 147:N,CA,C,0,CB,CG,CD1,CD2 

TIB:GLY 148:N,CA,C,0 

TIBtGLY 149:N,CA,C,0 
50 TIB:ALA 150:N,CA,C,0,CB 

TIB: LEU 151:N,CA,C,0,CB,CG,CD1,CD2 

TIB:ALA 152:N,CA,C,0,CB 

TIB:THR 153:N,CA,C,0,CB,0G1,CG2 

TIB:SER 170:N,CA,C,0,CB,OG 
55 TIB:TYR 171:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,OH 

TIB:GLY 172:N,CA,C,0 

TIB:ALA 173:N,CA,C,0,CB 
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TIB:PRO 174:N,CA,CD,C,0,CB,CG 

TIB:ARG 175 : N, CA, C, O, CB, CG, CD,NE, CZ ,NH1 , NH2 

TIBrVAL 176 : CA, C, O , CB, CGI , CG2 

TIBrlLE 196:N,CA,C,0,CB,CG1,CG2^CD1 
5 TIB:THR 197:N,CA,C,0,CB,0G1,CG2 

TIBiHIS 198:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

TIBrTHR 199:N,CA,C,0,CB,0G1,CG2 

TIBrASN 200:N,CA,C,0,CB,CG,OD1,ND2 

TIB: ASP 201:N,CA,C,0,CB,CG,OD1,OD2 
10 TIBtlLE 202:N,CA,C,0,CB,CG1,CG2,CD1 

TIBrVAL 203:N,CA,C,O,CB,CGl,CG2 

TIBrPRO 204:N,CA,CD,C,O,CB,CG 

TIBrARG 205:N,CA,C,0,CB,CG,CD,NE,CZ,NH1,NH2 

TIB:LEU 206:N,CA,C,0,CB,CG,CD1^CD2 
15 TIB:TRP 

221:N,CA,C,0,CB,CG,CD1,CD2,NE1,CE2,CE3,CZ2,CZ3,CH2 

TIB:ILE 222:N,CA,C,0,CB,CG1,CG2,CD1 

TIBrTHR 226:N,CA,C,0,CB,0G1,CG2 

TIBrGLY 246:N,CA,C,0 
20 TIB:ASN 247 : N, CA, C, O, CB, CG, ODl ,ND2 

TIB:ASN 248:N,CA,C,0,CB,CG,0D1,ND2 

TIBrGLN 249:N,CA,C,0,CB,CG,CD,0E1,NE2 

TIBrASN 251:N,CA,C,0,CB,CG,0D1,ND2 

TIBrlLE 252:N,CA,C,0,CB,CG1,CG2,CD1 
25 TIBrPRO 253:N,CA,CD,C,0,CB,CG 

TIB: ASP 254:N,CA,C,0,CB,CG,0D1,0D2 

TIBrlLE 255:N,CA,C,0,CB,CG1,CG2,CD1 

TIBrPRO 256:N,CA,CD,C,0,CB,CG 

TIBrALA 257:N,CA,C,0,CB 
30 TIBrHIS 258:N,CA,C,0,CB,CG,ND1,CD2,CE1,NE2 

TIB:LEU 259:N,CA,C,0,CB,CG,CD1,CD2 

TIBrTRP 

260rN,CA,C,O,CB,CG,CDl,CD2,NEl,CE2,CE3,CZ2,CZ3,CH2 

TIBrTYR 261rN,CA,C,0,CB,CG,CDl,CD2,CEl,CE2,CZ,OH 
35 TIBrGLY 263:N,CA,C,0 

TIBrLEU 264:N,CA,C,0,CB,CG,CD1,CD2 

TIBrlLE 265rN,CA,C,0,CB,CGl,CG2,CDl 

TIBrGLY 266rN,CA,C,0 

TIBrTHR 267:N,CA,C,0,CB,0G1,CG2 
40 Subset RESTX: 

restxmole • list 
Subset RESTX r 

NEWMODELr 14, 16, 18-20, 31-34, 36, 38, 40, 48-50, 56-66, 68, 78- 
79 88 91—93 

45 NEWMODEL: 104-106, 120, 136, 225, 227-229, 250, 262, 268 
restxatom. list 
Subset RESTX: 

NEWMODEL:ALA 14 :N, CA,C,0,CB 

NEWMODEL:TYR 16:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ,0H 
50 NEWMODEL:ALA 18 :N,CA, C,0,CB 

NEWMODEL:ALA 19 :N,CA,C,0,CB 

NEWMODELlALA 20:N,CA,C,O,CB 

NEWMODELrGLY 31:N,CA,C,0 

NEWMODEL : THR 32 : N , CA, C , O , CB , OGl , CG2 
55 NEWMODEL:ASN 33 :N, CA, C,0, CB, CG, ODl, ND2 

NEWMODEL: ILE 3 4 : N , CA , C , O , CB , CGI , CG2 , CDl 

NEWMODEL:CYS 36:N,CA,C,0,CB,SG 
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NEWMODEL: 


GLY 


38: 


N,CA,C,0 




NEWMODEL: 


ALA 


40: 


N,CA,C,0,CB 




NEWMODEL: 


ASP 


48: 


.N,CA,C,0,CB,CG,0D1,0D2 




NEWMODEL: 


ALA 


49: 


:N,CA,C,0,CB 


5 


NEWMODEL: 


THR 


50: 


:N,CA,C,0,CB,0G1,CG2 




NEWMODEL: 


GLU 


56: 


:N,CA,C,0,CB,CG,CD,0E1,0E2 




NEWMODEL: 


ASP 


57: 


;N,CA,C,0,CB,CG,ODl,OD2 




NEWMODEL: 


SER 


58: 


:N,CA,C,0,CB,OG 




NEWMODEL: 


GLY 


59: 


;N,CA,C,0 


10 


NEWMODEL: 


VAL 


60: 


:N,CA,C,0,CB,CG1,CG2 




NEWMODEL: 


GLY 


61; 


:N,CA,C,0 




NEWMODEL: 


ASP 


62 


:N,CA,C,0,CB/CG,0D1,0D2 




NEWMODEL: 


VAL 


63 


:N,CA,C,0,CB,CG1,CG2 




NEWMODEL: 


:THR 


64 


:N,CA,C,0,CB,0G1,CG2 


15 


NEWMODEL: 


;GLY 


65 


:n,ca,c,o 




NEWMODEL: 


:PHE 


66 


:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 




NEWMODEL: 


;ALA 


68 


:N,CA,C,0,CB 




NEWMODEL: 


:LEU 


78 


:N,CA,C,0,CB,CG,CD1,CD2 




NEWMODEL: 


:SER 


79 


:N,CA,C,0,CB,OG 


20 


NEWMODEL: 


lASN 


88 


:N,CA,C,0,CB,CG,0D1,ND2 




NEWMODEL: 


GLY 


91 


:N,CA,C,0 




NEWMODEL: 


:ASN 


92 


:N,CA,C,0,CB,CG,0D1,ND2 




NEWMODEL: 


:LEU 


93 


:N,CA,C,0,CB,CG,CD1,CD2 




NEWMODEL: 


:CYS 


104:N,CA,C,O,CB,SG 


25 


NEWMODEL- 


:SER 


105:N,CA,C,O,CB,OG 




NEWMODEL 


:GLY 


106:N,CA,C,0 




NEWMODEL, 


:VAL 


120:N,CA,C,0,CB,CG1,CG2 




NEWMODEL 


:PRO 


13 6:N,CA,CD,C,0,CB,CG 




NEWMODEL 


:GLY 


225:N,CA,C,0 


30 


NEWMODEL 


:LEU 


227:N,CA,C,0,CB,CG,CD1,CD2 




NEWMODEL 


:VAL 


228:N,CA,C,0,CB,CG1,CG2 




NEWMODEL 


:PRO 


229:N,CA,CD,C,0,CB,CG 




NEWMODEL 


:PRO 


250:N,CA,CD,C,O,CB,CG 




NEWMODEL 


:PHE 


262:N,CA,C,0,CB,CG,CD1,CD2,CE1,CE2,CZ 


35 


NEWMODEL 


:CYS 


268:N,CA,C,0,CB^SG 



Example 10 

Providing a lipase variant E87K+D254K 
The. Humicola lanuginosa lipase variant E87K+D254K was 
40 constructed, expressed and purified as described in WO 
92/05249. 

Example 11 

Lipase-S-PEG 15.000 conjugate 
45 The lipase variant E87K+D254K~SPEG conjugate was prepared as 
described in Example 7/ except that the enzyme is the ffujnicola 
lanuginosa lipase variant (E87K+D254K) described in Example 10 
and the polymer is mPEG15,000, 

50 Example 12 
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Iimnunoaenecitv assessed as IaGi_ of lipase variant rD87K+D254K) in 
Balb/C mice 

Balb/c mice were immunized by subcutanuous injection of: 

i) 50 ^1 0.9% (wt/vol) NaCl solution (control group, 8 mice) 
5 (control), 

ii) 50|il 0,9% (wt/vol). NaCl solution containing 25 [ig of protein 
of a Humicola lanuginosa lipase variant (E87K+D254K) (group 1, 

8 mice) (unmodified lipase variant), 

iii) 50% 0>9% (wt/vol) NaCl solution containing a Humicola 

10 lanugoinosa lipase variant substituted in position D87K+D254K and 

coupled to a N-succinimidyl carbonate activated mPEG 15, 000 (group 

2, 8 mice) (lipase-SPEGlS, 000) • 

The amount of protein for each batch was measured by optical 

density measurements. Blood samples (200 ^1) were collected 
15 from the eyes one^week after the immunization, but before the 

following immunization. Ser\im was obtained by blood clothing, 

and centrifugation. 

The IgGi response was determined by use of the Balb/C mice 

IgGi ELISA method as described above. 
20 Results: 

Five weekly immunizations were required to elicit a 
detectable humoral response to the unmodified Humicola 
lanuginosa variant. The antibody titers elicited by the 
conjugate (i.e. lipase-SPEG15, 000 ranged between 960 and 1920, 
25 and were only 2 to 4x lower than the antibody titer of 3840 
that was elicited by unmodified HL8.2-Lipolase (figure to the 
left) . 

The results of the tests are shown in Figure 1 

As will be apparent to those skilled in the art, in the light 
3 0 of the foregoing disclosure, many alterations and modifications 
are possible in the practice of this invention without departing 
from the spirit or scope thereof. Accordingly, the scope of the 
invention is to be construed in accordance with the substance 
defined by the following claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 
(i) APPLICANT: 
5 (A) NAME: Novo Nordisk A/S 

(B) STREET: Novo Alle 

(C) CITY: Bagsveard 
(£) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP): DK-2880 
10 (6) TELEPHONE: +45 4444 8888 

(H) TELEFAX: +45 4449 3256 

(ii) TITLE OF INVENTION: A modified polypeptide 

(iii) NUMBER OF SEQUENCES: 9 
(iv) COMPUTER READABLE FORM: 

15 (A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 

20 (2) INFORMATION FOR SEQ ID NO: 1: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus sp. PD498, NCIMB No. 40484 
(ix) FEATURE: 
30 (A) NAME/KEY: CDS 

(B) LOCATION:!. .840 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TGG TCA OCG AAT GAC CCT TAC TAT TCT GCT TAC CAG TAT G6A CCA CAA 48 
35 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 

1 5 10 15 ' 

AAC ACC TCA ACC CCT GCT 6CC TGG GAT GTA ACC CGT GGA AGC AGC ACT 96 
Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
40 20 25 30 

CAA ACG GTG 6CG GTC CTT GAT TCC GGA GT6 GAT TAT AAC CAC CCT GAT 144 
Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 

45 

CTT GCA AGA AAA GTA ATA AAA GGG TAC GAC TTT ATC GAC AGG GAC AAT 192 
Leu Ala Arg Lys Val lie Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
50 55 60 

50 AAC CCA ATG GAT CTT AAC GGA CAT GGT ACC CAT GTT 6CC GGT ACT GTT 240 
Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

GCT GCT GAT ACG AAC AAT GGA ATT GGC GTA GCC GGT ATG GCA CCA GAT 288 
55 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 

85 90 95 

ACG AAG ATC CTT GCC GTA CGG GTC CTT GAT GCC AAT GGA AGT GGC TCA 336 
Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
60 100 105 110 

CTT GAC AGC ATT GCC TCA GGT ATC CGC TAT GCT GCT GAT CAA GGG GCA 384 
Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 

65 

AAG GTA CTC AAC CTC TCC CTT GGT TGC GAA TGC AAC TCC ACA ACT CTT 432 
Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
130 135 140 



70 AAG AGT GCC GTC GAC TAT GCA TGG AAC AAA GGA GCT GTA GTC GTT GCT 



480 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

OCT GCA GGG AAT GAC AAT GTA TCC CGT ACA TTC CAA CCA OCT TCT TAC 528 
5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

CCT AAT GCC ATT GCA GTA GGT GCC ATT GAC TCC AAT GAT CGA AAA GCA 576 
Pro Asn Ala lie Ala Val Gly Ala He Asp Ser Asn Asp Arg Lys Ala 
10 180 185 190 

TCA TTC TCC AAT TAC GGA ACG TGG GTG GAT GTC ACT GCT CCA GGT GTG 624 
Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 



15 



AAC ATA GCA TCA ACC GTT CCG AAT AAT GGC TAC TCC TAC ATG TCT GGT 672 
Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
210 215 220 



20 ACG TCC ATG GCA TCC OCT CAC GTG GCC GGT TTG GCT GCT TTG TTG GCA 720 
Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

AGT CAA GGT AAG AAT AAC GTA CAA ATC CGC CAG GCC ATT GAG CAA ACC 768 
25 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

GCC GAT AAG ATC TCT GGC ACT GGA ACA AAC TTC AAG TAT GGT AAA ATC 816 
Ala Asp Lys He Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
30 260 265 270 

AAC TCA AAC AAA GCT GTA AGA TAC 840 
Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

35 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 280 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

45 Trp Ser Pro Asn Asp Pro Tyr Tyr Ser Ala Tyr Gin Tyr Gly Pro Gin 
1 5 10 15 



50 



Asn Thr Ser Thr Pro Ala Ala Trp Asp Val Thr Arg Gly Ser Ser Thr 
20 25 30 

Gin Thr Val Ala Val Leu Asp Ser Gly Val Asp Tyr Asn His Pro Asp 
35 40 45 



Leu Ala Arg Lys Val He Lys Gly Tyr Asp Phe He Asp Arg Asp Asn 
55 50 55 60 

Asn Pro Met Asp Leu Asn Gly His Gly Thr His Val Ala Gly Thr Val 
65 70 75 80 

60 Ala Ala Asp Thr Asn Asn Gly He Gly Val Ala Gly Met Ala Pro Asp 

85 90 95 

Thr Lys He Leu Ala Val Arg Val Leu Asp Ala Asn Gly Ser Gly Ser 
100 105 110 

65 

Leu Asp Ser He Ala Ser Gly He Arg Tyr Ala Ala Asp Gin Gly Ala 
115 120 125 

Lys Val Leu Asn Leu Ser Leu Gly Cys Glu Cys Asn Ser Thr Thr Leu 
70 130 135 140 
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Lys Ser Ala Val Asp Tyr Ala Trp Asn Lys Gly Ala Val Val Val Ala 
145 150 155 160 

5 Ala Ala Gly Asn Asp Asn Val Ser Arg Thr Phe Gin Pro Ala Ser Tyr 
165 170 175 

Pro Asn Ala lie Ala Val Gly Ala lie Asp Ser Asn Asp Arg Lys Ala 
180 185 190 

10 

Ser Phe Ser Asn Tyr Gly Thr Trp Val Asp Val Thr Ala Pro Gly Val 
195 200 205 

Asn He Ala Ser Thr Val Pro Asn Asn Gly Tyr Ser Tyr Met Ser Gly 
15 210 215 220 

Thr Ser Met Ala Ser Pro His Val Ala Gly Leu Ala Ala Leu Leu Ala 
225 230 235 240 

20 Ser Gin Gly Lys Asn Asn Val Gin He Arg Gin Ala He Glu Gin Thr 
245 250 255 

Ala Asp Lys lie Ser Gly Thr Gly Thr Asn Phe Lys Tyr Gly Lys He 
260 265 270 

25 

Asn Ser Asn Lys Ala Val Arg Tyr 
275 280 

(2) XKFOHMATION FOR SEQ ID NO: 3; 
30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
35 (ii) MOLECULE TYPE: protein 

(vi) ORIGINAL SOURCE: 

(B) STRAIN: Bacillus lentus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

40 Ala Gin Ser Val Pro Trp Gly He Ser Arg Val Gin Ala Pro Ala Ala 

15 10 15 



45 



His Asn Arg Gly Leu Thr Gly Ser Gly Val Lys Val Ala Val Leu Asp 
20 25 30 

Thr Gly He Ser Thr His Pro Asp Leu Asn He Arg Gly Gly Ala Ser 
35 40 45 



Phe Val Pro Gly Glu Pro Ser Thr Gin Asp Gly Asn Gly His Gly Thr 
50 50 55 60 

His Val Ala Gly Thr He Ala Ala Leu Asn Asn Ser He Gly Val Leu 
65 70 75 80 

55 Gly Val Ala Pro Ser Ala Glu Leu Tyr Ala Val Lys Val Leu Gly Ala 

85 90 95 



60 



Ser Gly Ser Gly Ser Val Ser Ser He Ala Gin Gly Leu Glu Trp Ala 
100 105 110 

Gly Asn Asn Gly Met His Val Ala Asn Leu Ser Leu Gly Ser Pro Ser 
115 120 125 



Pro Ser Ala Thr Leu Glu Gin Ala Val Asn Ser Ala Thr Ser Arg Gly 
65 130 135 140 

Val Leu Val Val Ala Ala Ser Gly Asn Ser Gly Ala Gly Ser He Ser 
145 150 155 160 

70 Tyr Pro Ala Arg Tyr Ala Asn Ala Met Ala Val Gly Ala Thr Asp Gin 
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165 170 175 

Asn Asn Asn Arg Ala Ser Phe Ser Gin Tyr Gly Ala Gly Leu Asp lie 
180 185 190 

Val Ala Pro Gly Val Asn Val Gin Ser Thr Tyr Pro Gly Ser Thr Tyr 
195 200 205 

Ala Ser Leu Asn Gly Thr Ser Met Ala Thr Pro His Val Ala Gly Ala 
210 215 220 

Ala Ala Leu Val Lys Gin Lys Asn Pro Ser Trp Ser Asn Val Gin lie 
225 230 235 240 

Arg Asn His Leu Lys Asn Thr Ala Thr Ser Leu Gly Ser Thr Asn Leu 
245 250 255 

Tyr Gly Ser Gly Leu Val Asn Ala Glu Ala Ala Thr Arg 
260 265 

(2) INFORMATION FOR SEQ ID NO: 4: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(vi) ORIGINAL SOURCE: 

(B) STRAIN: Arthromyces ramosus 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

Gin Gly Pro Gly Gly Gly Gly Gly Ser Val Thr Cys Pro Gly Gly Gin 
1 5 .10 15 

. Ser Thr Ser Asn Ser Gin Cys Cys Val Trp Phe Asp Val Leu Asp Asp 
20 25 30 

Leu Gin Thr Asn Phe Tyr Gin Gly Ser Lys Cys Glu Ser Pro Val Arg 
35 40 45 

Lys lie Leu Arg He Val Phe His Asp Ala He Gly Phe Ser Pro Ala 
50 55 60 

Leu Thr Ala Ala Gly Gin Phe Gly Gly Gly Gly Ala Asp Gly Ser He 
65 70 75 80 

He Ala His Ser Asn He Glu Leu Ala Phe Pro Ala Asn Gly Gly Leu 
85 90 95 

Thr Asp Thr He Glu Ala Leu Arg Ala Val Gly He Asn His Gly Val 
100 105 110 

Ser Phe Gly Asp Leu He Gin Phe Ala Thr Ala Val Gly Met Ser Asn 
115 120 125 

Cys Pro Gly Ser Pro Arg Leu Glu Phe Leu Thr Gly Arg Ser Asn Ser 
130 135 140 

Ser Gin Pro Ser Pro Pro Ser Leu He Pro Gly Pro Gly Asn Thr Val 
145 150 155 160 

Thr Ala He Leu Asp Arg Met Gly Asp Ala Gly Phe Ser Pro Asp Glu 
165 170 175 

Val Val Asp Leu Leu Ala Ala His Ser Leu Ala Ser Gin Glu Gly Leu 
180 185 190 

Asn Ser Ala He Phe Arg Ser Pro Leu Asp Ser Thr Pro Gin Val Phe 
195 200 205 
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Asp Thr Gin Phe Tyr He Glu Thr Leu Leu Lys Gly Thr Thr Gin Pro 
210 215 220 

Gly Pro Ser Leu Gly Phe Ala Glu Glu Leu Ser Pro Phe Pro Gly Glu 
5 225 230 235 240 

Phe Arg Met' Arg Ser Asp Ala Leu Leu Ala Arg Asp Ser Arg Thr Ala 
245 250 255 

10 Cys Arg Trp Gin Ser Met Thr Ser Ser Asn Glu Val Met Gly Gin Arg 

260 265 270 



15 



Tyr Arg Ala Ala Met Ala Lys Met Ser Val Leu Gly Phe Asp Arg Asn 
275 280 285 

Ala Leu Thr Asp Cys Ser Asp Val He Pro Ser Ala Val Ser Asn Asn 
290 295 300 



Ala Ala Pro Val He Pro Gly Gly Leu Thr Val Asp Asp He Glu Val 
20 305 310 315 320 

Ser Cys Pro Ser Glu Pro Phe Pro Glu He Ala Thr Ala Ser Gly Pro 
325 330 335 

25 Leu Pro Ser Leu Ala Pro Ala Pro 

340 

(2) INFOBMATION FOR SEQ ID NO: 5: 
(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 876 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
35 (vi) ORIGINAL SOURCE: 

(B) STRAIN: Humicola lanuginosa DSM 4109 
(ix) FEATURE: 

(A) NAME/KEY: sig peptide 

(B) LOCATION :1.. 6? 
40 (ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 67, •876 
(ix) FEATURE: 

(A) NAME/KEY: CDS 
45 (B) LOCATION:!.. 87 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATG AGG AGC TCC CTT GTG CTG TTC TTT GTC TCT GCG TGG ACG GCC TTG 48 
Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser- Ala Trp Thr Ala Leu 
50 -22 -20 -15 -10 

GCC AGT OCT ATT CGT CGA GAG GTC TCG CAG GAT CTG TTT AAC CAG TTC 96 
Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 15 10 

55 

AAT CTC TTT GCA CAG TAT TCT GCA GCC GCA TAC TGC GGA AAA AAC AAT 144 
Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 

60 GAT GCC CCA GOT GGT ACA AAC ATT ACG TGC ACG GGA AAT GCC TGC CCC 192 
Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

GAG GTA GAG AAG GCG GAT GCA ACG TTT CTC TAC TCG TTT GAA GAC TCT 240 
65 Glu Val Glu Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
45 50 55 

GGA GTG GGC GAT GTC ACC GGC TTC CTT GCT CTC GAC AAC ACG AAC AAA 288 
Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
70 60 65 70 
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TTG ATC GTC CTC TCT TTC CGT GGC TCT OGT TCC ATA GAG AAC TGG ATC 336 
Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

5 

GGG AAT CTT AAC TTC GAC TTG AAA GAA ATA AAT GAC ATT TGC TCC GGC 384 
Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

10 TGC AGG GGA CAT GAC GGC TTC ACT TCG TCC TGG AGG TCT GTA GCC GAT 432 
Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

ACG TTA AGG CAG AAG GTG GAG GAT GCT 6TG AGG GAG CAT COC GAC TAT 480 
15 Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
125 130 135 

CGC GTG GTG TTT ACC GGA CAT AGC TTG GGT GGT GCA TTG GCA ACT 6TT 528 
Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
20 140 145 150 

GCC GGA GCA GAC CTG CGT GGA AAT GGG TAT GAT ATC GAC GTG TTT TCA 576 
Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 



25 



TAT GGC GCC CCC CGA GTC GGA AAC AGG GCT TTT GCA GAA TTC CTG ACC 624 
Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 



30 GTA CAG ACC GGC GGA ACA CTC TAC CGC ATT ACC CAC ACC AAT GAT ATT 672 
Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

GTC CCT AGA CTC CCG CCG CGC GAA TTC GGT TAC AGC CAT TCT AGC CCA 720 
35 Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
205 210 215 

GAG TAC TGG ATC AAA TCT GGA ACC CTT GTC CCC GTC ACC CGA AAC GAT 768 
Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
40 220 225 230 

ATC GTG AAG ATA GAA GGC ATC GAT GCC ACC GGC GGC AAT AAC CAG CCT 816 
He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 



45 



AAC ATT CCG GAT ATC CCT GCG CAC CTA TGG TAC TTC GGG TTA ATT GGG 864 
Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 



50 ACA TGT CTT TAG 876 
Thr Cys Leu * 
270 

(2) INFORMATION FOR SEQ ID NO: 6: 
55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Arg Ser Ser Leu Val Leu Phe Phe Val Ser Ala Trp Thr Ala Leu 
-22 -20 -15 -10 

65 Ala Ser Pro He Arg Arg Glu Val Ser Gin Asp Leu Phe Asn Gin Phe 
-5 1 5 10 

Asn Leu Phe Ala Gin Tyr Ser Ala Ala Ala Tyr Cys Gly Lys Asn Asn 
15 20 25 

70 
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Asp Ala Pro Ala Gly Thr Asn He Thr Cys Thr Gly Asn Ala Cys Pro 
30 35 40 

Glu Val 61u Lys Ala Asp Ala Thr Phe Leu Tyr Ser Phe Glu Asp Ser 
5 45 50 55 

Gly Val Gly Asp Val Thr Gly Phe Leu Ala Leu Asp Asn Thr Asn Lys 
60 65 70 

10 Leu He Val Leu Ser Phe Arg Gly Ser Arg Ser He Glu Asn Trp He 
75 80 85 90 

Gly Asn Leu Asn Phe Asp Leu Lys Glu He Asn Asp He Cys Ser Gly 
95 100 105 

15 

Cys Arg Gly His Asp Gly Phe Thr Ser Ser Trp Arg Ser Val Ala Asp 
110 115 120 

Thr Leu Arg Gin Lys Val Glu Asp Ala Val Arg Glu His Pro Asp Tyr 
20 125 130 135 

Arg Val Val Phe Thr Gly His Ser Leu Gly Gly Ala Leu Ala Thr Val 
140 145 150 

25 Ala Gly Ala Asp Leu Arg Gly Asn Gly Tyr Asp He Asp Val Phe Ser 
155 160 165 170 

Tyr Gly Ala Pro Arg Val Gly Asn Arg Ala Phe Ala Glu Phe Leu Thr 
175 180 185 

30 

Val Gin Thr Gly Gly Thr Leu Tyr Arg He Thr His Thr Asn Asp He 
190 195 200 

Val Pro Arg Leu Pro Pro Arg Glu Phe Gly Tyr Ser His Ser Ser Pro 
35 205 210 215 

Glu Tyr Trp He Lys Ser Gly Thr Leu Val Pro Val Thr Arg Asn Asp 
220 225 230 

40 He Val Lys He Glu Gly He Asp Ala Thr Gly Gly Asn Asn Gin Pro 
235 240 245 250 

Asn He Pro Asp He Pro Ala His Leu Trp Tyr Phe Gly Leu He Gly 
255 260 265 

45 

Thr Cys Leu * 
270 

50 (2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
55 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = •'R28K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

60 gggatgtaac caagggaagc agcactcaaa eg 32 

(2) INFORMATION FOR SEQ ID NO: 8: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCa^IPTION: /desc = "R62K oligo" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
5 cgactttatc gataaggaca ataaccc 27 

(2) INFORMATION FOR SEQ ID NO: 9: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc - "R169K oligo" 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



caatgtatcc aaaacgttcc aaccagc 



27 
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Patent Claims 

1. A polypeptide-polymer conjugate having 

a) one or more additional polymeric molecules coupled to the 
5 polypeptide, having been modified in a manner to increase the 

number of attachment groups on the surface of the polypeptide, in 
comparison to the number of attachment groups availaJ^le on the 
corresponding parent polypeptide, and/ or 

b) one or more fewer polymeric molecules coupled to the 
10 polypeptide, having been modified in a manner to decrease the 

number of attachment groups at or close to the functional site(s) 
of the polypeptide, in comparison to the number of attachment 
groups available on the corresponding parent polypeptide. 

2. The conjugate according to claims 1, having 1 to 25, 
15 preferably 1 to 10 additional polymeric molecules coupled to the 

surface of the polypeptide in comparison to the number of 
polymeric molecules of a conjugate prepared from the corresponding 
parent enzyme. 

3. The conjugate according to claims 1 and 2, wherein the 
20 additional attachiaent group (s) is (are) amino groups in the form of 

Lysine residues (s), or carboxylic groups in the form of Aspartic 
acid or Glutamic acid residues. 

4. The conjugate according to any of claims 1 to 3, wherein 
the additional attachment group (s) is (are) prepared by a 

25 conservative substitution of an amino acid residue, such as an 
Arginine to Lysine substitution. 

5. The conjugate according to claims 1 to 3, wherein the 
additional attachment group (s) is (are) prepared by a conservative 
substitution of an amino acid, such as an Aspargine to 

30 Aspartate/Glutamate or a Glutamine to Aspartate/Glutamate 
substitution. 

6. The conjugate according to any of claims 1 to 5, wherein 
the added attachment group is located more than 5 A, preferably 8 
A, especially 10 A from the functional site. 

35 7. The conjugate according to claim 1, having 1 to 25 

preferably 1 to 10 fewer polymeric molecules coupled at or close 
to the functional site of the polypeptide in comparison to the 
number of polymeric molecules of a conjugate prepared on the basis 
of the corresponding parent polypeptide. 
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8. The conjugate according to claim 7, wherein the removed 
attachment group (s) is (are) amino groups in the form of Lysine 
residues (s), or carboxylic groups in the form of Aspartic acid or 
Glutamic acid residues. 
5 9. The conjugate according to any of claims 7 and 8, wherein 
the removed attachment group (s) is (are) prepared by a conservative 
substitution of an amino group, such as Lysine to Arginine 
substitution. 

10. The conjugate according to any of claims 7 to 8, wherein 
10 the removed attachment group (s) is (are) prepared by a conservative 

substitution of a carboxylic group, such as an Aspartate/Glutamate 
to Aspargine or Aspartate/Glutamate to a Glutamine substitution. 

11. The conjugate according to any of claims l to 10, wherein 
the removed attachment group is located within 5 A, preferably 8 

15 A, especially 10 A from the functional site. 

12. The conjugate according to any of claims 1 to 11, wherein 
the attachment groups are broadly spread. 

13. The conjugates according to claims 1 to 12, wherein the 
parent polypeptide moiety of the conjugate has a molecular weight 

20 from 1 to 100 kDa, preferred 15 to 100 kDa. 

14. The conjugate according to claim 13, wherein the parent 
polypeptide moiety of the conjugate has a molecular weight of from 
1 to 35 kDa. 

15. The conjugates according to claim 14, wherein the parent 
25 polypeptide is an enzyme selected from the group of 
Oxidoreductases, including laccases and Superoxide dismutase 
(SOD); Hydrolases, including proteases, especially subtilisins, 
and lipolytic enzymes; Transferases, including Transglutaminases 
(TGases) ; Isomerases, including Protein disulfide Isomerases 
30 (PDI) . 

16. The conjugate according to claim 15, wherein the parent 
enzyme is PD498, Savinase®, BPN", Proteinase K, Proteinase R, 
Subtilisin DY, Lion Y, Rennilase®, JA16, Alcalase® or a Humicola 
lanuginosa lipase, such as Lipolase®. 
35 17. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a PD498 variant with one or more of the 
following substitutions: R51K, R62K, R121K, R169K, R250K, R28K, 
R190K, P6K, Y7K, S9K, AlOK, YllK, Q12K, D43K, Y44K, N45K, N65K, 
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G87K, I88K, N209K, A211K; N216K, N217K, G218K, Y219K, S220K, 
Y221K, G262K. 

18. The conjugate according to claim 17, with one of the 
following mutations: R28K+R62K, R28K+R169K, R62K + R169K, 
5 R28K+R69K+R169K. 

19. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Savinase® variemt with one or more of 
the following substitutions: RIOK, R19K, R45K, R145K, R170K, 
R186K, R247K, K94R, P5K, P14K, T22K, T38K, H39K, P40K, L42K, 

10 L75K, N76K, L82K, P86K, S103K, V104K, S105K, A108K, A133K, 
T134K, L135K, Q137K, N140K, N173K, N204K, Q206K, G211K, S212K, 
T213K, A215K, S216K, N269K. 

20. The conjugate according to claim 16, wherein the enzyme 
moiety of the conjugate is a Hujnicola lanuginosa lipase variant 

15 with one or more of the following substitutions: 

R133K,R139K,R160K,R179K,R209K,R118K,R125K,A18K,G31K,T32K, 
N33K,G38K,A40K,D48K,T50K,E56K,D57K,S58K,G59K,V60K,G61K,D62K, 
T64K,L78K,E87K,N88K,G91K,N92K,L93K,S105K,G106K,V120K,P136K,G225 
K,L227K,V228K,P229K,P250K,D254K,F262K. 

20 21. The conjugate according to claim 20 with the following 

mutations E87K+D254K. 

22. The conjugate according to any of claims 1 to 21, wherein 
the polymeric molecules coupled to the polypeptide have a 
molecular weight from 1 to 60 JcDa, especially 1-35 kDa, especially 

25 3 to 25 kDa. 

. . 23. The conjugate according to claim 22, wherein the poly- 
meric molecule is selected from the group comprising a natural or 
synthetic homo- and heteropolymers, selected from the group of the 
synthetic polymeric molecules including Branched PEGs, poly-vinyl 

30 alcohol (PVA) , poly-carboxyl acids, poly-(vinylpyrolidone) and 
poly-D,L-amino acids, or natural occurring polymeric molecules 
including dextrans, including carboxymethyl-dextrans, arid 
celluloses such as methylcellulose, carboxymethylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, and 

35 hydrolysates of chitosan, starches, such as hydroxyethyl-starches, 
hydroxypropyl-starches, glycogen, agarose, guar gum, inulin, 
pullulans, xanthan gxims, carrageenin, pectin and alginic acid. 

24. A method for preparing improved polypeptide-polymer 



' wo 98/35026 



115 



PCT/DK98/00046 



conjugates comprising the steps of: 

a) identifying amino acid residues located on the surface of the 
3D structure of the parent polypeptide in question, 

b) selecting target amino acid residues on the surface of said 3D 
5 structure of said parent polypeptide to be mutated, 

c) i) substituting or inserting one or more amino acid residues 
selected in step b) with an amino acid residue having a suitable 
attachment group, and/ or 

ii) substituting or deleting one or more amino acid residues 
10 selected in step b) at or close to the functional site, 

d) coupling polymeric molecules to the mutated polypeptide. 

25. The method according to claim 24, wherein the 
identification of amino acid residues located on the surface on 
the polypeptide referred to in step a) are performed by a computer 

15 program analyzing the 3D structure of the parent polypeptide in 
question. 

26. The method according to claim 24, wherein step b) 
comprises selecting Arginine or Lysine residues on the siirface of 
the parent polypeptide. 

20 27. The method according to claim 24, wherein one or more 
Arginine residues identified in step b) is (are) substituted with a 
Lysine residue (s) in step c) • 

28. The method according to claims 27, wherein the 
substituted Arginine residues have a distance of more than 5 A, 

25 preferably 8 A , especially 10 A from the functional site. 

29. The method according to any of claims 24 to 28, wherein 
the polypeptide prepared in step . c) is coupled to polymeric 
molecules . 

30. Use of the conjugate in claims 1 to 23 for reducing the 
30 allergenicity of industrial products. 

31. Use of the conjugate in claims 1 to 23 for reducing the 
immunogenicity of pharmaceuticals. 

32. A composition comprising a conjugate of any of claims 1 
to 23 and ftirther comprising ingredients used in industrial 

35 products. 

33. The composition according to claim 32, wherein the 
industrial product is a detergent, such as a laundry, dish wash or 
hard surface cleaning product, or a food or feed product. 
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34, The composition according to claim 32, comprising a 
conjugate of any of claims 1 to 22 and fxirther ingredients used in 
skin care products. 

35. A composition comprising a conjugate of any of claims 1 
5 to 23 and further comprising ingredients used in pharmaceuticals. 
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